Docket No.: CL001103CON 
Serial No.: To Be Assigned . 
inventors: Gennady MERKULOV et al; 
Title: ISOLATED HUMAN TRANSPORTER ... 

1 CaSCAACCCC GACQGCGCCC CAAAGGCTCT TGOSCOGCJGC GCCCOGCCCA 
51 GCC03GCCTC GCGCTGGrCC CGGTCTCGCC COGCAGGCCT CGATCTCCCG 

IDI TCAcrrccrc qcjCOvqqcgg ccrecx5ccrc toggaccatc TrccjGcreGC 

151 TGCXaQGAGT C3QCQCTCCCC ACOGGQQCCr GCCAGGAC3GC CGfiiXfiCCCG' 
201 AaXTGCTAO; AGACCCrCTT CCAGGCACTC GACCGOWVTXi GGGACQGAGT 
251 QGTX5GACATC GGCGAGCTGC AQGAQGGGCT CAQGAACCTG QGCATCCCTC 
301 TQQGCCAGGA CGCOGAGGAG AAMTTTTTA CTACTQGAGA TCTCAACAAA 
351 GATOQGAAQC TOGATTTrcA AGAATTTATG AACTACOTA AAGACCATGA 
401 GAAGAAAATC AAATTQQCAT TTAAGAGnT AGACAAAAAT AATGATOGAA 
451 AAATPGAGGC TTCAGAAATT GTCCAGTCTC TCCAGACACT GGGrCTG^CT 
501 ATTTCTGAAC AACAAGCAGA GTTGATTCrr CAAAGCATTC ATGTTGATQG 
551 GACAATGACA GTGGACTGGA ATGAATGGAG AGACTACTTC TTATTTAATC 
601 CTGTTACAGA CATTCAGGAA ATTATCCGTT TCTCGAAACA TTCTACAQGA 
651 ATTCACATAG GGGATAGCTT AACTATTCCA GATCAATTCA CGGAAGAC3GA 
701 AAAAAAATCC GGACAATQGT GGAGGCAGCT TTTQGCAGGA GGCAmOCTG 

751 GTocrcrcrc tcjgaacaagc Acreccccrr togaccgtct gaaaatcatg 

801 ATGO^jOTTC ACX3OTTCAAA ATCAGACAAA ATCAACATAT T TO G TCG CrT 
851 TCGACAGATC GTAAAAGAAG GAGGTATCC3G CTOGCnTQG AGQGGAAATC 
901 GTACAAACGT CATGVAAATT GCTCCTGAGA CAGCfGlTAA ATTCTGQGCA 
951 TATCAACAGT ACAAGAAGTT ACTTACTGAA GAAGGAGAAA AAATAGGAAC 
1001 AtTTGAGAGA TTTAI I ICIG GTTCCATGGC TQGAGCAACT GCACAGACTT 
1051 TTATATAT<jC AATQGAGGTT ATGAAAACXA GGCreGCTXJT AQQCAAAACr 
1101 GGGCAGTACr CTQGAATATA TGATPGTTQCC AAGAAGATTT TGAAACATGA 
1151 AGGCTTGGGA GCII I I lACA AAQGCTATCT TCCCAATTTA TTAGGTATCA 
1201 TACOTATGC AGGCATAGAT CTTGCTUTUT ATGAGCTCTT GAAGTCCTAT 
1251 TGGCTQGATA ATTTTCCAAA AGATTCTCrA AACXXTCGAiG TCATOGTCTT 
1301 GCTQQGATQC QGrQCCITAt CCAGCAOCTG TOGTCAGCPS QCCAG6aCC 
1351 GATTQGCTTT GGTGAGAACT GGCATGCAQG CTCAAGCOVT GTTAGAAGGT 
1401 TCCCCACAGC TGAATATQGT TCGCCrCTTT C3GA0GAATTA TTTCCAAAGA 
1451 AQ6AATACXA QGACTTTACA GAQGCATCAC CCCAAAOTC ATGAAGGTOC 
1501 TCCCTQCTiGT AGGCATCAGT TATGTCGTTT ATGAAAATAT GAAGCAAACT 
1551 TTAGGAGTAA CCCAGAAATG ATCTPGCATT TtTTGCnTA GCCTGATAAT 
1601 TGAAACTTTC AACAATCTCT GGAGrGACTT TtTCTCCTCG AATTGAAACA 
1651 AGTCTATOGC AAAAGAAGCT GCAI I I I 1 1 I. CACAAAAGGG AAGACDGGTAA 
1701 CAATQGTCAC TTCAAACrn: TQGGCTAAAT TATATGTACA CAGAAATUTT 
1751 CAAAATCATA GTTTTAATGT: GTTTTCAAAA GGCCACACAA TTATACTTTA 
1801 TCI I I ILI lA ATAATCCreC AAATCTCTCC CCTCAATCCG AAATGTTGAAA 
1851 ATGTACTCGC TTCAACAAAA I I IGI I I Ibl GPGTTAGAGr TATAAATCAT 
1901 TAATCTTTAT TTCiQQGIGGrr TTACXJnTAT GGCAGTTGCT TTATATTTAA 
1951 Al I ICIIGII TTATATATTT TGAAIGICI I TATAGATTTt TTTAAATTTC 
2001 CTTATAGAAC CATTAATAGA AAATGATTAC ATTTAAAATA TACCTTACAG 
2051 CAAAAGCATC CAAATAAGTA TAGGGnTAT GTCCTTATTT TTCTTTCAGC 
2101 TGAATACiGAA TGAACACAGT GGTGGAATTT CTGAAGGGAA GTCATCAAAT 
2151 TATATTTATT TCAGTOGGCA CTTTTCO\TT TTACCACTGr ACCATTATTT 
2201 QGTTCrraGA GTTATACACr AATTTTONGT ATATTACTGT TAAATTACCA 
2251 ACACAAGGCA ATTTATTTGA AAGATTCQGT TTATCCTGCC ATTGCTTrcA 
2301 AAAGCAGCAG GAAACGAAAT I I II IGACTT GTATCAGCTT CTGCAGAGCA 

2351 TLMiGiiii ccrrrcrccr TrcnrccrA ccmrGAAT cagattccgt 

2401 TTTACTO^iG AAGACI ILI I QGGACCATTC TTAGTAACCT GAAAI I ILI I 
2451 TTTTAATPQC ATGAAGTQGA TTGATCATGA GCAAGTGATG GGCTTTATTT 

2501 crcccrcAcr ggtcaatatc cnTGAAor gligi i igca atatcggcag 

2551 CCACAAAGQG ,GGAGAGATCC CTATTAAATC GGGGGGGTCT ATGAQTCre 
2601 AAAACATTCG ATAOXTATT TTCAAAAGGG AAAGGCCCAA TTTGGGGAAA 
2651 CATAtACCM TQCATGAtTT CTC (SEQ ID N0:1) 
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FEATURES: 

5 'urn: 1-137 

Start CDdon: 138 
stop Godon:. _1569 

3'my:; : 1572 ' 

HOMOLOGOUS PROTEENS: 
TOD BLAST Hits: 



CRA 
CRA 
CRA 
CRA 
CRA 
GRA 
(iRA 
CRA 
CRA 
CRA 



335001098641184 /altid=gi 111360341 /def=pir| |T50686 peroxis. 
11000479457833 /altid=gi 16841066 /def=gb|AAF28888.1|AFl2330. 
18000005183605 /altid=gi 1 7504235 /defH)i r | |T22688 hypotheti . 
1000682325160 /altid=gi 17499323 /def^3^•^| |T21074 hypothetic. 
89000000196990 /altid=gi 17294582 /def=gb|AAF49922.1|. (AE003. 



150000075553401 /a1tid=gi 
335001098657884 ./altid=gi 
163000046661776 /altid=gi 
105000014652720 /altid=gi 
335001098655048 /a1tid=gi 



9758252 /def=dbi |BAB08751.1|. (ABO 

11358611 /def^)^^ -'- 

10176874 /def=dbj 



10798831 /def=db3 
11277065 /defi=pir 



IT49871 peroxis. 
BAB10081.1I (AB. 
BAB16462.1I (AP. 
IT47703 Ca-depe. 



Score 
927 

. 834 
432 
377 
348 
339 

. 330 
326 
. 200 
199 



BLAST dbESt hi ts : 



gi 

91 
g-i 

91 



EXPRESSION INFORMATION FOR MODULATORY USE: * 

library source: ' 

Expresision infomiatioh from BLASPdbEST hits:. 

gi 110145202 placenta Chondcarci noma 

gi 11437155 Retina 

gi 110333851 uterus leiomyosarcoma 

gi 18469752 Breast 

gi [11684041 Ovary fibrptheoma . , 

Expression information from PGR-biased tissue screening panels: 
Leukocyte 



E 

0.0 

0.0 

e-120 

e-103 

9e-95 

5e-92 . 

2e-89 

4e^88 

3er50:: 

6e-5G 



10145202 /dataset=dbest /taxbn=96 . . ?. 1108 0.0 . 

1437155 /dataset=dbest /taxon=9606 ... . ' . 801 0.0 

10333851 /dataset=dbest /taxor^=96 • : 745 0:0 ' 

8469752 /dataset=dbest //taxon=^. .-. . 363 8e-98 

11684041 /data5et=dbest /taxon=96J . ; ' 307 . 4e-8i 
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. 1 MLRWLRDFAL FTMGQDAEQ PTKYETLPQA LDRNGDGWD IGELQEGLRN 
51 LGIPLQQPAE EKIFTTCIMJ KDGKLDFEEF MKYLKDHEKK MKIAFKSLDK 
101 NNDGKIEASE IVQSLQfn-GL TESBQCJAELI LQSIDVDGTM TVDWNEWRDY 

■ 151 FLFNPVTDIE EHRFWKHST GIDIGDSLTE PDEFTEDEKK SGQaWROLLA 
201 GGIAGAVSRT STAPLDRUa IVMqVHGSKSD^^I^ 
251 WRGNGT>JVIK lAPETAVKFW AYEQYKKLLT EEQQKIGTFE RFISGSMAGA 

. 301 TACJTFIYPME VMKmLAVGK TQQYSGIYDC AKKILKHEGL GAFYKGYVPN 
351 LLGIIPYAGI DLAWELLKS VWLDNFAKDS VNPGVMVLLG C3GALSSTOQQ 
401 LASYPLALVR TRMQAQftMLE GSPQLMWVGL FRRUSKEGI PGLYRGITPN 
451 FMCVLPAVGI SYWYEI^«Q TLGVTXJK (SBQ ID NO: 2) 



FEATURES: 

Functional ciomains and key regions: 
[1] PPOClOOOOl PSOOOOl ASNJGLYCDSYLATION 
N-glycosyTation site 

254-257 NGTW (SEQ ID NO: 7) 



[2] . PDOC00005 PS00005 PKC_PHOSPHO_SrTE 
Ptx)tein kinase C phosphorylation site 



Number of matches: 2 

.1 229-231 SDK 

2 475-477 TQK 



[3] PDOO00006 PS00006 CK2J'H0SPH0jsiTE 

Casein kinase n phosphorylation site 



Number of 
1 

2 

. . . ,-. . 3 

4 

5 

6 

7 

8 



matches: 
22-25 
65-68 
. 121-124 
157-160 
170-173 
179-182 
185-188 
227-230 



8 

TRYE 
TPGD 
TISE 
TDIE 
TGID 
TIPD 
TEDE 
SKSD 



(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 



ID N0:8) 
ID N0:9) 
ID NO: 10) 
ID NO: 11) 
ID N0:12) 
ID NO: 13) 
ID NO: 14) 
ID. NO: 15) 



[4] PDOC00008 PSOOOOS.MYRISTYL 
N-myristoylation site 



Number of matches: 



. 1 
.2 
3 
4 
5 
. 6 
7 
8 
9 
10 
11 
12 
13 
14 



52-57 

119-124 
171-176 

201- 206 

202- 207 
245-250 
253-258 
283-288 
295-300 
322-327 
326-331 
359-364 
392-397 
399-404 



16 

GIPLGQ 
GLTTSE 
GIDId) 
GGIAGA 
GIAGAV 
GGIRSL 
GNGTNV 

qqkigt: 

GSMAGA 
GQYSGI 
GIYDCA 
GIDLAV 
GALSST 
GQLASY 



(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 



NO: 16) 
NO: 17) 
NO: 18) 
NO: 19) 
NO: 20) 
NO: 21) 
NO: 22) 
NO: 23) 
NO: 24) 
NO: 25) 
NO: 26) 
NO: 27) 
NO: 28) 
NO: 29) 
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15 442-447 GLYRGI . (SEQ ID NO: 30) 

16 . 446-451 GTTPNF (SEQ ID N0:31> . 



[5] PDooooois psobois ef_hand' 

EF-hand calciunHbinding dcxnain 
Nurdser of matches: 3 

1 32-44 Diy^JGDGVA/DIGEL (SEQ ID NO: 32) 

2 . . 68-80 DVNKDGKLDFEEF . (SEQ ID NO:33) 

3 . . 99-111 DKNNDGKIEASEI . (SEQ ID NO:34). 



Membrane spanning structure and domains: 
. . Helix Begin End Score certainty 
. 1 292 312 1.053 Certain 
... 2 345 365\ 0.613 Putative 
3 . 381 401 1.544 Certain 
.... 4 446 . 466 0.733 . Putative ' ^ 

BLAST Alignment to Top Hit: • 

>CRA| 335001098641184 7a1tid=gi 1 11360341 /def=pi r | jT50686 peroxisomal 

Ca-dependent solute carrier [imported] - rabbit 

/or9=rabbit /taxort=9986 /datasel>=nrala /Iength=475 

Length = 475 

score = 927 bits (2371), Expect = 0.0 

Identities - 454/477 (9590, Positives = 466/477 (97%), Gaps = 2/477 (05O 
. / . ■■ ' ■ - 

Query:. 1 MLRWLRDFALPTAAO^PAEQPTRYETLRJALDRNGDGVVDIGELQEGUWLGIPLGQPAE 60 

MLRWLR F LPTAAGQ AE PmYETLFQALDRNGDGWDI ELQEGL4+LGIPLGQPAE 

Sbjct: 1 . MLRWLRGFVLPTAAGQGAEPPTRYETLt=QALDRNGDGV^ 60 

Query: 61 EiaFTTGDVNKDGKLDFEEFMCYLKDHEl^^ 120 

ElaF^^GDVNKDGI<LbFEEF^11C«^l<DHEI<l^ 

Sbjct: 61 ElaFTTGDVNI<DGKLDFEERV|lC»^KmEI<KMKLAR^ 120 

Query:. 121 TISEQQAELILQSIDVDG^^*4TVDM^^EWW)\^ 180 

TESBQCJAELILQ5ID KHNnVDWNEWRDYFLFNPV DIEEI3RFWKHSTGIDIGDSLTC 

Sbjct: 121 TXSBQCyVELIlijSIDApGTMTVDWNEM^^ 180 

Query: 181 PDEFTOEKIGGQMifllQLUVGGIAGAVSRTSTAPLDRLiaM^^ 240 

PDEFre+E^^GQQWIflRQLLAQGIAGAVSRTS^APUDRLK4^^^^ . MNIFQGFRQ 

Sbjct:. 181 RJEFT^EERKSQQiMflRQLLAQGIAGAVSRTSTAPLDRLKNM^ 238 

Query: 241 IWKEGGIRSIJMlGNGTNVIiaAPErAVKFWAYEQYK)^ 

l«HXEGGHlSLWRas|GTNVIKIAPErAVKFW YEQYKKLLTEEGC^aGTFERFISGSMAGA 

Sbjct: 239 imEQGVRSLIflRGNGTlWIiaAPETAVKRl\A/YEQ^ 298 

Query: 301 TAQTTTYPMEVMiaT^VCXraQYSGIYDCAKiaLKHEGLGAFW 360 

TA(irFIYPI*€VMI<mAVGICTX3QYSGIY^^ GAFYKGWPNLLGIIPYAGI 

Sbjct: 299 TAQrFIYPMEVNIKTRLAVGICrajYSGrrtXAKI^ 358 

Query: 361 DLAVYELLKS^V/LDNFAKDSVNPGVIWLLGaSALSSTaKJLASYPLALVRTT^^ .420 

DLAVYELLKS4WLDNFAKDSVNPGV+VLLGaALSSTaXJLASYPLALVRnMJ^^ 

Sbjct: 359 DlAVyELLKSHWUX^FAKDSVNPGVLVLLGGGALSSTOQQLASYPLALVRTIMy^ 413 
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Query: 421 gsk}IJn|vivglfrriiskegipglyrgxtpniwvlpavgisyv\^^ 477. (residues 1 

477 of SEQ ID N0:2) r 

) GfPQU^^IVGLFRWJSKEGf PGLYRGTTTNmK^ 

Sbjct: 419 GAPQL^^^/GIJ=^ym5KEGLPGLYRG^TPNF^II<VLPA^^^ 475 
(SEQ ID N0:4) - - - - - . . 

>CRA 1 11000479457833 /al ti d=gi 1 6841066 /def=gb | AAF28888 . 1 1 AF123303JL 

(AF123303) calcium-binding transporter [Homo sapiens] 

/org=Homo sapiens /taxon=9606 /dataset=nraa /lengtb=411 

Length = 411 

' score = 834 bits (2132), Expect = 0.0 
Identities = 409/410 (9950,. Positives = 409/410 (9990 

Query:. 8 . FALPTAAajPAEQPTRYETLFQAUDRNGDGWDIGELQEGU^NLGIPLGQ 67 

F LPrAA<X|DAa5PmYEn.R5WJDRNGDGWDIGELQEGIJWLGIPLQQDAE 

Sbjct: 1 FVLPTAAOJPAEQPTRYETLRiWJWNGDGWDIGELQEGI^ 60 

Query: 68 DVNI<DGiaJ)FEEFM101J<DHEKKMK^ 127 
DVNI<DGKLDFEEFMI<YLJ<[)HEKKMI<U^F^ 

Sbjct:. 61 . DVNI<DGiaJ3FEEFMlCVlJ<DHEKKN0^^ 120 

Query: 128 EULQSIDVDGTMTVDWNBrt^DYFLFNPVTDIEEIIRF^^ 187 

ELILQSIDVWnMTVDWNEWRDYFLFNPVTDIEEIIRFWHST^ 

Sbjct: 121 ElJ:LJQSID^«;TMTVDWNEWRDYFLF^ 180 

Query: 188 EKKSQQWaIRQLIASGIAGAVSICTSTAPIJJRU^^ 247 

EI<l<SGQWARQLUVGGIAGAVSRTS^APLmLXI^^^ 

Sbjct: 181 EKKSQQIMslRQLUVlKIAGAVSRTSrAPLDRIJ^^ 

Query:. 248 RSLWRGNGTNVliaAPErAVI^^ 307 
RS^UfrtlGNGrrWIiaAPErAVKFWAYEQyi^ 

Sbjct: 241 RSljy^GNCTTNVIiaAPErAVKFV^^ 300 

Query: 308 PMEVMKtRU\VGIGT3QYSGIYDCAI^ 367 
PMEVMJORLAVGKTCQYSGTrtXyVKiaU^ 

Sbjct: 301 PMEVMKTRLAN^SiaGQYSGIYDCAKiaLKHEGLGAFYKGWPN^^ 360 

Query:. 368 U<SVVflJDNFAKDS\/NPG\M/LLGC)GAL5STCX3QLASyPLALV^^ 417 (residues 8-417 of 
SEQ ID N0:2) 

U<SVWlI)NFAI<DSVNPGVMVLLG0GAL5STa3QU^ 

Sbjct: 361 LXSYVflJ>JFAKDSVNPGVMVLLGGGALSST<3QQIJ\SYPl^\^^ 410 
(SEQ ID N0:5> 

Score = 80.0 bits (194), Expect = 6e-14 

Identities = 80/388 (2090, Positives - 156/388 (3990, Gaps = 59/388 (1590 

Query:. 95 FIGLI)K^M)aaEASEIVQSLQTLGLTISEQQAELILQSIDV--DGTMTVD«N 152 

FHlJ>fN DG ++ E+ + L+ LGf + + . E I . + DV . DG + 

Sbjct: 21 RJAUDRNGDGWDIGELQEGUWLGIPLQQP^EEKIFTTGDVNK^ 68 

Query: 153 R^PVn)IEEIIRR«fl<HSTOII)IGDSLTIPDEFTH)EKKSQQy*^ 212 

, . . D EE +++ K . + EKK -H- L + . 

Sbjct: 69 —DFEEFMKYLX DHEI<KMKUVFKSlI)KrM)GiaEASE^ 105 

Query: 213 APUDRLIOMvqVHGSKSDKMNttFGGH^Q^^ 272 

L L + + .-HT +1 . .V R.+ N I E. -H-FW +^ 

Sbjct: 106 QSLQTLGLTlSEQCy^LJLQSIDVrtXjmTVDW^ EEIIRFWKH161 
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Query: 273 EQVKKL LTEEOQiaGTFER-FISGSmGATAanTmviEVMK^^ 321 

+ TE++KG + R-HC +AGA -hT . Pt+ +K + V G . 

Sbjct: 162 STODIGDSLTIPDEFTEDEKKSGCJaWRQLU^^ 

CMery : 322 GQYSGIYDCAKiauoTEGLGAF^ 381 

1+ ++++K .G+- + +4G..WH-IP + YE K ++. 

Sbjct: 222 SDKM^RXSFRQMVKEGGIRSIJWRGhONN^^ 277 

Query:. 382 NPGVMVLLGa3AL5STCQQU\SYPLALVRITMy^ 441 

(5 Qi 1 1|. Q YP+ +++TR+ A+ + ECrh 

sbjct: 278 laGTF^ERFISGSM^VGATAQrF^ 334 

Query: 442 GLYRGTTPNFMKVLPAVGISYWYENviK 469 (residues 95-469 of SEQ ID N0:2) 

Y-H3 PN + ++P GI VYE +K 

Sbjct: 335 AFYKGYVPNLLGHPYAGIDLANA^LK 362 (SEQ ID N0:6) 



Hrmer search results 
Model Descn'otion 



(Pfam); 



Score 



E-value N 



PF00153 
PF00036 
PF00404 
PF01978 



Mitochondrial carrier proteins 305.4 

EF hand ^ 50.7 

Dockerin domain type I , ^ , : . • . . 9,7 

Protein of unknown function 2.7 



Parsed for domains: 



hrm-f himi-t 



score E-^va1ue 



PF00036 


V3 


27 


51 .. 


. 5 


29 .] 


. 18.7 


0.002 


PF00404 


VI 


67 


. 85 : . 


1 


. 22 [! 


9.7 


0.26 


PF00036 


2/3 , 


61 


87 . . 


3 


29 .■ 


19.7 


0.001 


PF00036 


.3/3 


90 


118 .. 


~ 1 


29 [] 


17.2 


. 0.0051 


PF01978 


.. i/i; 


.110 


121 


,1 


.13 [. 


2.7 


. 9.5 


PF00153 


.1/1 


193 


472 . . 


1 


313 D 


. 305.4. 


. 3e^88 



3e-88 
1.7e-12 
. . 0.26 
. . 9.5 



1 
3 

1 
1 
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1 MCCCATUTT AGTCTGCAGT: TCTCCTCGCA CACACATGCA GTTGTGrAAC 
. 51 CACTACCACC AAAAGCAAGA TCTAAAATAG CTCCATCACC CCCACAAGCC . 

. IDI TTCTWGcr cmrcrcAT caattccot ccggctagtc ACAAcrairA 

151 ACTACTCATT I G I 1 1 ILKa l CCCTATAGTT TTCCCmTC CfiG^TGTCA 
201 TTCTTCACAG CTATCAGTAA TTCATTCGT TTTATPQCTA ATTACTATCT 
. 251 CACTCTATCA ATGb\ACACA Gbl Idl I lAC CAGITCACCC GTTAAAGAAC 
. 301 Al I I IGI I IC TGCGCTTGAC AGTTATGAAT AGAACTGCTA TAAACCCTCA . 
351 AGTAAAAGTT TraGTCTGAA GATAATTTTC TOVGCAAAAA CXXTCACAGG 
401 TAATTTTTCT AACTATTACT TTTTTAAAAA AGTAAAATAG CCTUTAQCCC 
451 CAGCTACrCA GGAQGCTCAG GCAGGAGAAT AGCTTGAACC CAGGAQQCGG 
501 AQGTTGCAGr GAGTPGAGAT TCTCCCACre CATTCCAGCC TQQGC3GAGVG 
551 AQCTAGACrc TCTCAAAGAA AAAAAAAAAA AATAAONAAT AAATAAAAAG 
, 601 TAAAATGAAA GCATCTAAGT GTAAGATGAC TAGTTCAAGC AACCTCTCTT . 
651 CAAGTACAGA GTATTCAGAG TAGAGATTAA AAGAGGTTTT CAAQ6ACAGA . 
701 GAAAATTTGA AGTTPGAAGG CAGTTCCAAA GGAAGGCAAT GATTCTTAAT . 
751 AAGACTOGAA GTTQGAAGTA ATATAAAAAG ATAAATCAGT TTCAAGATGA , 
801 TTTTAiCTAAG OVQQOVGOCX: TTAATTTACA AATTCTAGAT TOVTACArAT 
. 851 CTTAAACATA CAAAATGATA TGAGGAGAGG TAAGTTCAGG GTCrGAGTTC 
901 CreGCTGrnG TPOGAACTGA TTTCTGnGTA GTGATTCAGA AGATGTGAGA , 
. 951 CACCCTAATT TACAAGTACA GAGGTATCrT CTTTTCTGCA AACAGCAGTA 
1001 CAACAATAGT TCCrCTTAOG OVGCTCTCAA TGAACAQGAT TATTACAATT 
1051 AATGATATCr OVrrTGATTG GCQCCTTAGA GAATTAAGAC CTTTCACACC 
1101 TAATATACAA CmUTTGTG AAGGCAGATA TTTATATTCT CATTTTACTG 
1151 ATGAGAGACr ACCGGGAGAC GCTATGTCAC ACCTGAAGGA TTAG6TACTT 
1201 TCrCrXJlTAA GTCCAATOTT CCrTCCGnTA TTCCATGCTA GGOVGTAATA 

1251 AGTTCrcrCr TGCCrXi'VGrA ataagctcca aacctgqgaa ctgcacccat 
1301 cttgagaaqg aqgagqggqc tctggttttt tctcataagt govgctogca 
1351 gacacrctat acgcttaatc acgggcaaat cctacctaag ctgcctacca . 
1401 aactagtccr tcmtcccc gttgcccagg ongatcgctc ttgatcmt . 
1451 crccaacaaa tccaggagtt tctclm 1 1 1 gnttataat tcctxzovata . 
1501 gatgctttag gatttaactc tctgli i 1 1 i aaagcagaat cx3ccatccca 
1551 ggtgpgcaac cacgaaaaaa ttagacatcc gtx5agagaca atgccctcca 
1601 tqgcccagtt tccaggcaga gagaagcagc tctgggcrga ccgccaaggc 
1651 tccggcccga gaqqgtcnt aagtx3ga6ta acongtcttc aagaccx:c3gc 
1701 tcccaaqcca cggacggqct gaoqctgovg ccctogacct gctqggggcc 
1751 tcttccrcgg accggcatgc tgacagcgqg actggcaact gggcagaqgt 
1801 cgaccccgqg tccgcacagc acctcccgag acccagctcc cagctccctc 
1851 acttccggct ctctggaggc gggcccggcc agigccgccg aggccagcgc 
1901 qqcgagcrcc tccocagcag cqqcqqgacg gcovcaccct gcgggccggg 
1951 cqqqcrcqqg togggtctcc gctcctgggc cctcgggqcc qcagccqovc 
2001 ccccgacggc gccccaaacg cturpgcgcc gggcgccccg ccovgcccx3g 

2051 CCrCGCGCrc GTCCCGGTCT: OGCCCCGCAG CCCTCGATCT CCCGTlGACTT 
. 2101 CCrCGGGCAG GCCGCCTXiCG CCTCTGGGAC CATCTTGCGC TGGCTGC3GGG 
2151 ACTTGGrGCr GGCOVCOsGG GCCTGCCAQG ACQOQGAGCA GCGGAOaGQC 
2201 TAC3GAGACCC TCTTCOVQQC ACTQGACGQC AATCQQGACG GAGTOGIGGA 
2251 CATCGGCGAG CTGCAGGAGG GGCTCAGGAA CCTGQGCATC CCTCTOQGCC 
2301 AGGACQCCGA GGAQGTTQGGT CGCCGCCQQG GCGCCQCCTG AGCGTAQGGA 
2351 GGGCTQCGGG CDGCTCGGGAC ACTCCGAQGA CCGAGGAGGG CQGCGGCTTG 
2401 AQGCGnGCC AGGAGAGGAA QGAQGAACTG 7GQCGCCCAG CGCTCOiGrG 
2451 GCrrCAGAAA CTC3GGG0GTG GQGCOGCGAC CQGOSACCCC GGTAAOVGAA' 
2501 GTCGGTCATA ATAGGAAAGT CTACreGTAT TTXJTCCAGAT AAAATGAGPG 
2551 TPGTGGACAC TCTQGCCCAC QQGCACrcTT AAAtnTTAA GACACmTG 
2601 TCCTGAATCC ATCCXAQGTT C III G II I I C IGII IIAATA CCttGCAGAC 
2651 ATGTAATCGG nTTAGCTCr CAGAOTOVB TQQGTCCCAA GTTTTXjTATA 
2701 AAGGCGCACA CATTCGATCT CTTTCGAAGC TGLI I IGI lA CAGCAGCTAT 
2751 GTCTATTCTC TACTCTTTGA AAALIGI I IG AAAACCAATC GGblGI I ICC 
2801 CCCACTTCa GTTCAGAAQG AATQGCQQOiV 1T0CATTGTT TAAGAGVTTC 
2851 CTAQGTTAAT GCCCrAQGTA CATAAATPSA TCTGAAQQCT TGACTTGACC 
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2901 TTGCGACTCAG CAATTTCATT TTCTCreAGT CATGTMCT GrGCCCCPGA 
2951 ACTTCTCCCC CnTACTAGG GTX3GAGATAT GTGGAACTTC TCCAACCCTC 
3001 TTCAAGCGTT CCCTCACACr QGCATTCTCr TATCO\AAGA QOGAAAGrGA 
3051 TTAQGrTACr ATGAQQGCO\ ACAACTCTTA TATAGTTATA TTTCACTrcr 
3101 CTTTTAATCT CTTTCGrAGr TATAGGCCTC TTCAGTTTAC Ibl I ICI I C T 
3151 AGAGTCA6AT TTAGTAAGiT ACAAI I I I I I TTCAAACTGC CIGI ICICalC 
3201 CAAGGTTCAT AATACTCACC GATGATTTTA TAACACTTCT GACTGAATCT 
3251 GTAQGrAQGr TC TCTAT TTC ATTCCTCATA TCrATCCTTT TCTCCCCTTC 
3301 AATOTCCCA AAGTTTTCrc TATnTATTC ATACTTTCAA QGAAOCAACT 
3351 TTTCCTACrr TCTGCTCATT GTCCCAGAAA TOGCCCAGIT QGAGTTCCCC 
3401 ACOATCTCCA ATGATPQGCT GGAAQCAGCC CAQGAAAQQG ACGACOTCC 
3451 TGCAGTGCAT CAGCAGATGC CAGGCTTAGA GGCTAGAGAG TGGAAGTCAA 
350r OGTUTTCCr CACAGTAGGT GCCTTTGAAG GGAGATCTOA GTOGTACAAG 
3551 ICCATCCTCC CTACAATATA CAAAAQCTCT TTGGAGnGCT CAAtGATTTT 
3601 TAAGATTCTA AAGGGATCCT GAGATCAAAA AGCTPGAGAA TTGCTCCTCr 
3651 ATCACCATTT TTAOGTAACr GGATCATATT CTCTTATATG I I IdlGICAT 
3701 AGTATATCTT ACD^TTOT TTTAAATCAC CmTAClTT ATTCATAGTT 
3751 TAAAAACGAT TCTAAGreAA ATTGCAATCG ATGrCCTTPG TATrCATTTt 
3801 CTO\TTCraG TCCAGnACr TTCGTAGGAT AAATTTTCAG GAGTGGACAT 
3851 TGCPGAGTCr GAAGGTAACA CACATTTTAA ACTGQGATAC GTATTQCCrr 
3901 TCQGAAACCT TAGACCOATT TTCACTCTTT TGACTCACAG TGCrTGCTTC 
3951 TCCACATCXT OQCTCATTCA GGGTATCAGT CTTTCTAAAG TCTCCTATTC 
4001 TGCAGGTGAA ATTCCTTTTC ATTTCCTCTC tTAGT'CCATr TAGrGrPGCT 
4051 ATAGTTQGAAT ATCTGAGACA GGGTAATTTA TAAAGAAAAG ACATTTAT7T 
4101 AGCrCACAGT TCC3GCAQGCr GGGAAGTTTA AGAAGCGTGG TGCTGGCATC 
4151 TGCTOGACrc CTQQQGAGQG CrTTCCrQCr GTXJTCACAAC ATQGTCGAAA 
4201 GHXAAAGTCG AAGTCGACAT GTCTGAAGAA GCAAAATCGG AGGGGrGTrCC 
4251 TQGCntATA GCAACCCAGG CTCGAQGGAA CTCATCCATT ACTGAGGGAA 
4301 CTAATTCAGr CTCATGAGAG AGAGAACTCA CrCACTACre CAAGAATGAC 
4351 ACCAAGCCAT TCATCAQGGA TCTGCGTCCG TAACCCTCAC ACCrCiCnSCr 
4401 AGGTCC CTCC TCCO^CACG GCCACATCAG GGATCAGACT TCAACA7GAG 
4451 1 1 I I I GTGGG GACAAACAAA ACGTAGCACT TGCnTGCCT TTrGGTrcrA 
4501 TTCACATCCT CCACAGGATT GCATTATGCC TACCGATTTC GTGAGGGCAG 
4551 TCI ILI I lAA I IGbl I lACr GATTCAAATC CTACCCrCCT CCAGAGACAT 
4601 GCTCACAGAC ACAGCCAGAA ATCATGTTTT ACCAGrTATC TQQGCATCCC 
4651 TTAGTCCAGA CGAGITGATA CATAAAATTA AGCATCACAC ATQGGATAGA 
4701 ATTAGGATTA CACAGTCAAC OTTATGGGA GAAAATTTCA GAGGGATTJTC 
4751 AGQGGTTTAT GTAATCTCAA GGAGTGAQGA CATTCGCTAC TTGAGCATAG 
4801 AAATCAGAAC TG7X5QGGTGA CrCTTGQGre GAAAGnTOA AGGTAGrAGT 
4851 TTCTATCTAA QCCAAATACT CftGCTPGAAG CAAAATCTCT ATAAATTTTC 
4901 ATCTGATTTG ATCTCATCTC CGTUnTCCA AGCATTTCTA ATGAATTGAG. 
4951 CATTTAGAAG AGAACAAATT TCrGTTTAAG I I ICI I lAGA TTTTAGATGG 
5001 AAAGAATGTA GAAATAAGAG TAGAATCTAG AAATAQGTAT AAAGAATATA 
5051 ATAGCTAACG ATTACTAAGr GTTCCAGAAT TATOCAQQGA AGAGAAAAGA 
5101 ATTCAAGGOA AOTCCTCAGA OVAAAtTAAG AACXAATTQG AAGTSAAAGC 
5151 GCTACATTTT I I I I I ICTGG TATGACCTTT CmTCTATA TCTTCCAAAT 

5201 crccrcACTA tgaaattagt gaaaaattaa agttaaaaat tagagaaaat 

5251 TCACATTAAG TTCTCCTAQG ACTCAGTAGT ATAAGQGTAT AGACPGAGAG 
5301 TAGAATCTAG TOTGAGAACA AQGAGATACA GTATTTAACC ATTACTAATT 

5351 droTATAcr tctctagtaa TCCTATrrcc ttttaaaagt cttcagttat 

5401 TTTCTCrnA CGCACCrCCr TCrCCCTCTT GTCTTCCrCC ttoaccccc 
5451 ATCI I ICI IC CTCTOGAGCC TTCATGAATC QGATTAG7GC TTCTATAAAA 
5501 GTCACCTQGA AGACXTTCCT TCCCCCTTCC ACCATCTCAG GAOVCAGTGA 
5551 GAAAACAGPG GTCCATQGAA CCGGAAAGTG GGTCCTCACT AGACAGTAAA 
5601 TCrCCTAGCA CTTOGATCTA GGACTTCCAG TGTCTGGAAC TCCAAGAAAT 
5651 CAATGOTAT TGnTAAGTA AGCCAGTAGT Al 1 1 I IGICA TAGCAGCCCA 

5701 GrrciGAcrAG gacaattacc aagagcaaga aqqgaagcag caagctacaa 

5751 GAGAGTT0C3G TCCI lOblGI. AAATTCACXG TGTAATCXTT GTCAAOTTTC 
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5801 AGCOTACre GAGCTTTACr TTCTTATTCT TAAAATGCAG ATATOTGCC 
5851 TGCATCCTQG ACAGAGGnT TAACAAQCTC ATATGTTGCA GAATATGAAA 
5901 GTTCATGTTA AAAAACCCTT TAAAATCTQG TATCCCATTT ACrAGCTOCT 
5951 GAACTTCrrc AQGAACCrCT GTCCOCATQG GTATGAAGTC TATCCTGAAT 
6001 GATCACCCAA TCnAGAGGA GTOQGTOGAC TCGTAACCTC ATTTAAQQGC 
6051 CATTCTAACT CTTACATTCT'ATGAI 1 1 U I TAATTCTOTC mAAGnTt 
6101 TACATTTACA ATCACAGAAA AAATACTCAC ATAGAAGAAT AGTAGCTTAG 
6151 CAAATCTTTA TTCCATreAG TQGAATOVQG ATTTOVCTCC ATTAAGTAAT 
6201 TCCrCTXnTA ACAAAGAGQG TKATTTCAT TnTATTTCA TTAATATTCC 
6251 I I 1 1 I I I I I I 1 1 I 1 1 ICPGG AGACAGAATC TTQCTCrATC ACCAAGGCTG 
6301 GAGTCCAGTG GrGGGATCTC GGCTCACreC AGCCrCTGCr TCCTQGATTC 
6351 AAGCGATTCr TUTOCCTCAG CCTCCCAAGC AGCTCAGATT ACAGGONCAT 
6401 GCCACCACAC ClUsI lAACT TTTCTATTTT CTAOTAGAGA TGGGAI 1 1 Id 
6451 CCATCTTCGr CAQGCTQGrC TTGAATTCCT GGCCTCTAGr GATCTCCCTX3 

6501 ccrcrcccrc TCAAAGnGcr aagattacag gcatgagcta ccatqgccag 

6551 CCCATTTCCT TAATATTTTA ATTCTCAGAC ATGTTATQGT TTCnOGCACA 
6601 ATAUAAGAA GACATGATAt GAAATCACAG GGnGAATTTT AGGGCATCAC 
6651 AACAGAAAGA TTATGGTATA AGAAAAACAA TCGAATTCCA ACTACATTTC 
6701 TUrCAAATGT TCTAAAATAT ATAAAATCTG TATCrTTTGr GTTCrCTCCT 
6751 GATTTATATT CrAAATTTGA TCTTATCCTT CrCTGG\GAA ATAAAGrcTC 
6801 TGAAAGAATG AAAAAAATQG AAGAATTCTT TAGTAAQGTA TAAAATACCG 
6851 TmCTATGT TGrAGCATTC TAAGCCmT GrTCACCFmC ONAACTCCCA 
6901 ACATGCCATA TTCCCTGACT AGGCCACAGC CATCTACATT GATCCCTTTA 
6951 IlllCI ICrC TCTTGCCPGAG ATTTCrCTGA TrCCCCCTTC TCTGCCTGGT 
7001 ATATGATTGC CCATTCTTTA AGGCCCCAAC TCACCTTTAT AATCTTCCTA 
7051 GCCCACTTTC TrrATCQGTA TTOCAGAAAA AAONAAAGAA GCTTCCACAA 
7101 GACAACATTC TUTAATACAC TGCTTAACTT CTTTTCACCC TGCreAGTTC 
7151 AAAAATOTA TCI 1 1 1 lAAG GATTGAATCG AGTCCACCAA GGTATCTATA 
7201 TTTGACAGGA TTTATGAAAA CAAAAQGATT TGrTTGAGAAA GTITGAAGCC 
7251 TAACrCTGAA ACGTCGATCA TAGTUnTAC TACACATTAA CTGTTTTAGT 
7301 GGATGTAATA GTTATTATTA TAGGCrGPSG AATCAGAACA GGGTTCAAAT 
7351 GTTTTCACC3G CTTGCrAGAC TCrGGCCTPG GGCATCTTAT TTAATGCGTG 
7401 GAGGCCrCAA ATGTTAACTA GGAATQGTAA GACCTACCCA GTAAOTAGC 
7451 ATAAATAGTA AATTCATTOA nTAATCTTT TGAAACAGTG CCAGACATPG 
7501 TTTAATGAAC TCQQGATATA GTGGTGAACA ACACTCACAG OdWU ItAT 
7551 TGrATTCrCA AAACCCTCCC TATAGTAAGT AGGTCTCTCT GTTJTCTGTAG 
7601 GTCCATQQGG AATAAAAAAT AATAAGCAAA TAATGAACAG GGTAATTTCA 
7651 AAAAGCAGAA AGAGCTATTC AACAAAACTA CCreCCTTTT ATTAGATGAA 
7701 ACTCTCAACT CTATOGTrTG TTCTCTCCTC TCAATTCrer TAAATQCTCT: 
7751 CAGC CICil 1 1 TCCTTATCAC COnOGCOWDG A LML I b l C T I I I C I U-MG 
7801 GTCCTUTAGA CTCTAACCCA AGGCTCATTC TCTGCGPQGC TATCTGCCTT 
7851 CPCTQGCrCr TTGCCACrAC CTACATTTTC TGRJITGCAC AGQGAAQGAC. 
7901 CATTCCCTCT GGACCATAAA ATTCTCmT TGAAAGAATT CATTCTTGAT 
7951 TQGGCCACAG CACATCrnGT GMfiCfiQCAT TAGACATTrG CCACTGCTCA 
8001 GCAGCTCrOG GQGAAAATGT TTACTCAGAA GGGTACAGTA Gl I M II IGA 
8051 CTAACCATQG TGCAACCTCG TCCCAGAGGG AAACCTATGA GTATTTCAAG 
8101 GACATGTTGAT QGrCTOTTTT TCTCCCCAGT ATCTGACATG ATGGGrAGTG 
8151 TAGAGCAAGA GOTACAGAT AATGGCTAAA TTAAATTTrC I 1 1 I IGAATT 
8201 TTAATATTCA ACM 1 1 lAGG GTACCCAATC TCCATATTTA GGAAAATAAA 
8251 TTAOVTAAAA AGTOGAGAGT TTTTATTCrc AAACTQCACC TCCATATTCC 
8301 CAGTGGreCA GGATCAGGGA GCACAQGrGT TCGTCTGGGG AAGCCAQGGC 

8351 ccrcrciXiGr tctggaqggt: gaqgattaag aqgaagcctt agatagtatt 

8401 TATGAGTATC TQCTGACrrc TCrcrOGGAC CCAAGATCAC TGAACTTTTG 
8451 CCTAI I I \UK GATCATCTTT CCAATCCAGG CACTAACAGC TGAAGGATAG 

8501 Gcrrecccre gagccattgt agtcgttqga tgaagataaa agataaaaaa 

8551 CPGrSAGQQG AGGTCTCACA GAAGAAAGGG CCCATCTGGG CAGATTTTCA 
8601 TTCAATTCCr ACTCTTTATT ACAQCAATTC TCGAGTGCTG CAACCTTAGA 
8651 AAAQGATTOC TACAACACAA TCTAQGTACC CATCAGCAGC AGATTCGATA 
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CACCATCGAA TACTATQCAG CCATAAAAAA . 
GCAGOVATAT GAATGCAGCT GGAAGCCAAT 
GAAACAGAAA AACAAATACT GreTTCTCAT 
TOGCTTAAATG GGGCATAAAG ATCGGAACAA 
AGGGGGGAGG GAGGGAGGAG GGCAAGGGCT 
LI I IGI ICAC AACCTGGGTC ATQQCAQGAT 
TCACACAGTA TACCCTTCTA ACAAGCTGAT 
AATAAAATTA TnTATTTTA AAAAATCATT . 
GGATTCCTAG AO«3GTCCAG CXAAACAATT" 
0GCO\CCQCC AGTOkCTTAT GCTGCAATAG . 
ACCTACTTCr CTCCAAAAGA GAAGCTATAC 
GGTTCrCCCT GGAAGnTCT QGGGAAAGGG . 
ACrCTTCCPG GAlCTGGGAGC CGQQGCrrCT 
GGAACrCOGC GGTCTCCCAG CGO«5CCCAG 
CCGGGTCCrc GGCGCTC3GCG CCTTTGGGCT 
TCrCCTTCCA GAGCnTAAC CJGATGAAGGF , 
GAGGAQGATG C I G I L I l A iGG CXTCTTCCCA 
GAGATCCOGT TOGTCGGTCG CACTTCCACC 
CGCGGAGCre CGAGGGAGAC ATCCTOGATG 
LI II IGGTAC CrOGACTATA ACAAGGATQG 
TTCAGGAAGG CCTGGAGGAT GTAGGGGCCA . 
AAGGTCQGrC TOVCTQGGGC TCTAATCAGA 
CCTX3GAGAGG CATTQGGCAG AGAQGGCAAA 
GACCTGGGCC CACTGCAGTG TTCAGGTCGr . 
TAAGAATAAC AAOVCAGCTA ACACATTTCT 
CrCCTTCGCT GTAGTAAAAT CTCCAACrTC 
GCTACATACA GCLI IGILI I. AlQGAGfXACC 
TCATTAGTCA CCCAGAGGGG CGrCTAGGCT 
TCAGAGAACr GGAATAATG^ CTCTACGTXJr 
TTGGAAATTT TCPGATGTTA TGrnTQGTT . 
GTCGAAGTOG CmTACTa: CGGGTITCAC . 
AATATOGOT AATTGATAGA CCCTAGrTAT . 

cccrTcraT ccccagaagg ctaacctaca 

TQCrTCGTAG ATACTCCTAT TGCAGTATTT 
AATTTATTAT TCTATATAAT AAtTACTTTA 
AIGI I ILACC CGGTAGACTG GGAGATCATC 
TATTTTOGAT AA1TQGTACT TCGTCCCCAA 
GTCATGnGA AATTTAGTAA AACrCTTTCT . 
GAATATTCAA GCATCCATTC AGTCKTCCGV 
TCTATATTCC AAGCCCXATA OCCTOGrATC 
CCTGGACTCT GTACnTCTG TQGACCAATT . 
GCAAAGCTTA TCTGGATTTT TAATTCGTAG 
TTTAAAGTTA AGIGI ILI IG TTCAGGreCA 

CAQCAcrrre ggagqccaag gcaqgtogat 

AGACOVQGCr GGCCAATATC GTAAAACCOC 
AATTAACQQG GTCTGCrGGr GGGnTTSTCT 
GGAGAATCAC TTGAGCCTX5G GAGGCAGAQG 
ATCACTGCAC TCGAACCTX5G GTTGACAGAGT 
AAAAAAAAAA GTTAAGTXnT OTCATATTT . 
TTAGATmOC AAGTOTAAGT TGTAI I \U I 
TTCATAAGAA ATTCrQGGTT AGCTATCAAG . 
TTCTTCGTTA T7GAAACAAA AGGI IIGIAG 
AAQGAOQCrr TCTOTACCTA GPGGTCACrG 
ACCCAAGQGA GGGGTCTCCC CAGAGAATTC 
TTLIGI I ICA GAQGAGCACC ATTCTGACTT 
TCGTTAGGAG GmTGATGT GAGGTCTCTT 
OVGTAQGAAA ATTTGTTTAT ATA6ACAAAA . v 
AAAAAGAAAT GATACTTACA I IGlCJGIGIi 
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11601 AAGATACAAA AGCAATMCT TnTATTCTC AAAATAGTCT GnTTTGAAC 
U651 AATATAircr I ilCil I I 1 1 1 CCTCTGAAAG TTGAGAAACT AAATATAGGA 
11701 AGAGATAATG GTCAGACCAT AAATAAAAAT AGAACTTTCA CTCAAAATTT 
n751 ACAQCAGTCr GCGCAGAAAA CCAGCXXTTT ATCTAAAATA AACAGAOCAG 
1L801 GAAACCAGCC TCTTATSTCA GACTTATAGG AAGTCAGGTT GCfATCTiCrA 
1L851 GAGACAATAC ACAAAGCTAT GCAATAACFG CTGTAACAQC CCCAAATQGT 
1L901 CAGAATTTGA TTAATAACCG AOVGCCCCCC TAAI II II II CTTCACTNNN 
1L951 NNNNNNNNNN ^JNN^I^JN^iNNN NNNNNNNNNN NNNNNNNNNN NNNNNNhnTC 
12001 ACGGCmGCr AGAACTGTCG CCTTCQGTCA TCttATTTAA TQCCTOGAQG 

12051 CCrCAAATUr taactaqcta atcctaagac ctacccagta acttagcata 

12101 AATAGTAAAT TCATTCATTT AAlCilllICA AACAGTCCCA GACAIIGI I I 
12151 AATGAACTGG GGATATAGTC GTCAACAACA CTGACAGGGT TCTTCATTGr 
12201 ATTCTCAAAA CCCrCCCTAT AGTAAGTAQG TCTCTCreTC TCTCTAQGTC 
12251 CAtQQQGAAT AAAAAATAAt AAGCAAATAA TGAACAATAA AATTATTTTA 
12301 TTTAAAAAAA AAGAAATGAT ACTTACATPG TCGTUnAAG ATACA AAAGC 
12351 AATAACmr TATTUTGAAA ATAGrGTGTT TTTGAACAAT ATAI IGI I I I 
12401 Gl 1 1 1 1 IGCr GFGAAAGTPG AGAAACTAAA TATAGGAAGA GATAATOGTC 
12451 A6ACCATAAA TAAAAATAGA ACnTGACrC fiAAATTTACA GCAGTCTGCC 
12501 GAGAAAACCA GCCCRTATe TAAAATAAAC AGACO^GGAA ACCAGCCTGT 
12551 TATGTTCAGAC TTATAGGAAG TCAGGTTCCT ATCTCTAGAG ACAATACACA 
12601 AAGCTATQCA ATAACrGCTG TAACAGCOCC AAATQGTCAG AATrPGATTA 
12651 ATAACC3GACA GCCXXXCTAA I III I II LII CACTTGCAAC TTAQGAGGAA 
12701 CCAGAGAAAG CTAAATATGC ACCACCtACr AATCAAATAG QGTGCCjGCGr 
12751 TTCTAATGAA CCCTCCTACA GCTTCCCCAG GCCAGCAGCC CGCAATO\GG 
12801 AAACGCCPGA AGCCTTCCCT TrTTCrCACT GTAAAGCTTT CCCACTCCTC 
12851 TGCCTXjGCTT ■pGAGTCTCrc TCAATACACA AGrGAGGGre TCrGACTCCC 
12901 TTQCTATAGC AAACTOOGQC CAAGTAGATT TTACmTCT CATTTGAfre 
12951 GTCmTATT TCTAGAAQGA ACATAOW5A AAATTTAAAG GQGAAt<XAT 
BOOl TCCTAATCTT TCATATTATA GTAGTGCCa" TTTATCTGCA GQGCATATTT 
.13051 TCCAAGACCC CCACTGAATA CCTCAAACTG TGQGTAATAT TGAACCGTAT 
13101 ATATACTCTC TCTATATATA CATATATATA TATA 1 1 III I. AAI 1 1 I III I 
13151 TACITTATCr TTAATTAGCr TTAQCrClTT llllilll l I TGAGATQGAG 
13201 TCrCACTCTG TCACCCAQGC TGAGTTGCAGG GGTGCAGrCT TOGITO^CTG 

13251 CAACcrcrcr CTACOGGGrr caagcaattt cnuTGCcrc aacctcoqga 
13301 gtagctgqga ctaonggcgt gtccgaccac ttcctggcta AI IGI 1 1 iaa 

13351 ATTTTAGTAG AAAGQQGATT TCACCAAGnt GGCCAGACTG GrCrCGTACr 
13401 TCTCACCrCA AGTGATCGQC COVCCmOGC CTCCOVAACT GCT3GGATTA 
13451 CAQGCGnnGAG CCACCOTGCG CCCAGCCATA GACTATATAT TTTTGATCTC 
13501 ATAACTGGTT CAGCrACTAA GPGACTAACA QQCAA6TAGC ATCrATAGTC 
13551 TCGATATQCr QGACAAAAQG AOVrTCAGCT CCnaOQCAQG ATGQCAOVGA 
13601 ATGTTCAGAG ATTTTATCAT GCTACTCAGA ATQGTGTCCA ATTTAAAACT 
13651 TATGAGTTGT I IGl I ICIGG AGmTCCAT TTAATAGTTC AGACCATQGA 
13701 TTGACCGCAG GTAACTGAAA CTGTGGAGAG TGAAACTCTG GATAAGQGAG 
13751 GAOATPGrA TTCTTAAGrC AGACTCATTA QGOVATCATA AGTOTGATT 
13801 TCCCATCAGA AATCCTQCAG AAATATOOGT TAAAAAAAAC TGriTCAAAAA 
13851 TAGGGTCAGG GATGrCCTTT AALI IGI lAC TTCCAAAATG TTAGPGAAAA 
13901 CTCTQGCCCC AAAGAGTCAA AGGAACAAAT GACTAAGAGA AAATCI IGI I 
13951 TTCAGGATGA CAGATTAAAA AAGAAGCAAC TTCCPGAAAC ACTGAAAATC 
14001 TCrCCACTTG TAAGATAACA CAAAACTOGC TAAAACTOGT TQGAATGAAT 
14051 ATQQCOVACr ONAGFCTCCA CAGAACTAAC TPOGTCATGr TACAGCCCAA 
14101 ATTTCCACCA CATATTTTAT ACTAACTCCC CCGGGATTTT CACACATCAT 
14151 CrGTGAQGTA GCATCAAGAG GTAACTATGC ATGCCTAAGG ACTTQQGAGA 
14201 CCrCCCCATT TCCTTCCACC AATCACCOXC TAATCCCAGA ATCCGCCCCC 

14251 AAACcrrrrc taataactac cttaaagcca gcataqqgag Acagatttga 

14301 GCTXSGACrcC I GILI I C I lb TQQGTGACCr TQCAATAAAA AGCI 1 1 ILI I 
14351 TTCrCAACAC CTOGTATTAT AGrATTGACT TCrAGTTCAT CGGGCAGCAA 

14401 Gccccmrc GrcoGreAcr attli igi ic gctcatattt ccattqggca 

14451 AAATATAAAC CTCTTAGATG AAACTTCAGr AGGTAAATQG OGCO^CAGAA 
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14501 TGCTGTCACA TmTCraT GGATTATAGC AGGTTACnT ACTGAATACC 
14551 GTAGQCAGTT ATAACAO^CT AAGTATTTCT GTATCTAAAC ATAGAAAAGA 
14601 TACAGTAAAA ATATCCTAAT 1 1 1 1 1 ICAAC TTrTAGtTCA GATTTQGAQG 
14651 GTATCPGOVC Al I IGI lACA AQQGTATATT GOVrGATQCT GAGGTTTCGG 
14701 GTACAATTCA ACCCrcrCAC CCAQGrAGTX3 AGCATAGTAC CCAATOGATA 
14751 ATTTTTCAAG eGTTCrCGAT TCeCTCeCCE TrcrTCTAGr CCCCAGTTTC 
14801 TG CmrC CC A TCTTTAT AT CCGTCTCCAC CCCATCTTTT GCrCCCATGT 
14851 GTATCTGAGA ALI IblbblG TTTOOTTTTC TAI I ICIGGG TPGATTOQCr 
14901 TAQGATAATC GCCnCAGCt GCATCCATCT IQCTQCfiGfiG GAOGTCATTT 
14951 TATTLI I LI I TATSGCTCTXJ TAGTATTCCA TQGTGAAAAA TATAGTACTA 
15001 TAACCTTACr AAATCACTCT CATATATATG GTCTATCATT GACTCAAATG 
15051 TATACACTQC ATGATATATA TATATATATA TCTATAATCT OTATCCATr 
15101 TCGTCTATTA TGAGATTTGA TPGCTAATAT TTTATACAQG Abl I I IGCAT 
15151 CI I II ICACr AGTTCACATT GCrTCTAATT TTCLI II I I I TGTXSATGTCC 
15201 CrcTTAGGTT TTAGAATCAA GrTCTATACCC GCCTCATAAA ATCQGnOGA 
15251 AAATCTTCCC ACGLil ILIG TTCTCTOGAA AATrGGTXJTT I I 1 1 ILI lAA 
15301 AGTTnOGTAG ACATTATTGr TAAAACCATG GQGnXTOGA IIIIICIICA 
15351 TQGAAATCTT TTCAAATTAC ACTTTAAATT tCTTTAAAAT CTCAGTATAG 
15401 GGCTATCAGA CTTTCTGCTG TCn"ATGrCA GM 1 1 lAATA Abl IGI I I 1 1 
15451 GTAGGCGnr GTTATCTCAC TnnGATATTT TTGATATAAA GCTTTTCATA 
15501 ATATCATTAA TCTCTATAGr GTCrAGTAGT TTCCATGTTT ACI I ILIGAC 

15551 ATTCGnATT TGCo^G^^^T AQGAGmAT CAATTTTATT AGrcrnrcA 

15601 AAGAACCATC TTTTCGCnT GTTAATCCTC CCAATGGTXJT Gl I I ICI IIC 
15651 T(XrTACm TraTOTTA TTTCOTOVA LIILI II 1 1 I GCTTAATTTT 
15701 AAAATAATTT aTGAGATTC AGATAAGCCT CAATGATQQG TCACCGATTT 
15751 CCAGTCnrC TTCmTCTA ATTATGCATT TTAAACCAGA AATCTrTCTC 
15801 TAAGTGTAGC TlTAGTraCA QCTCAOAAGT TTCAGATGlS TCTOtAGTC 
15851 TQGAGGTTQG AGATCTCACC ATGACCATGA AACCATCCAG TCAONATCTC 
15901 GCATTATTTT TTTAAI I I I I I I I I I I I I II TTGAGATAGA GnTCACTCT 
15951 TATPGCCTAG GCTGGTGTGC AATQGTGCGA TCTCGGCTCA G^GCAACCTC 
16001 CACCrCCCAG GTTCAACjGGA TTCmrraCC TGAGCCTCCC AAGTAGCrOG 
16051 GATTACAGGC ATGGGCCACC ATGCCCAACT AATTTPGrAT tnTAGTAGA 
16101 GATSQQGGTT CTCCATGrrc GTCAGGmSG TCTPGAACTC CCGACCTCAG 
16151 GTCATCCGCC OVCCTCAGCC TCCO\AAGTG CTX3QGATTAT AQGAATGAGC 
16201 CACTCTCCCX: GGOXTUOT QQCATTAtTT AOXAGAAGA GCATGAGCAT 
16251 GAGAACAGTA GAATTTGrAA GCnTGAGtG QGTGACTATG AGTUTCATAA 
16301 TAGGTAGATA GGrtATATTT TGQGTGGTCG TAGGAGAQGG CTTACAGrTT 
16351 GCTATGAOVG LI I I I lATAT GGATCATCCT TAGTAAAAGA TTATTTAATT 
16401 TTTGAAATCA AAQQQGAAAA CACTAGnTA GGLI I ILI IC I I ILI I ILI I. 
16451 TTTTA6AGAC AQOGTCrnQC TCTGnnCAClCA QCrTAGAATG OVGTGGTCCA 
16501 ATATTOGTCA CTXnAACCTC AAATTCCTCG GCTCAAGTGA TCCTCCTACC 
16551 TCAGCCrCCA AGTAGCTAGr ATTTACAGGC ATGGACCAAC ACATGTGGCT 
16601 AATTTTAAAA ATTTTTTATG GAGATGAQGT CTCACTATGrr TGTCCAGTCr 
16651 QGTCTTGAAT CCTGACCTCA AGTCATCCTC CCCCATCAQC CTCCCAAAGTr 
16701 GCTCONATAT TTTAAATCCT GreGTAGGTC AAGIOil IGI CtTCTATnT 
16751 QQGGTTTATA AAGTACATGr CAAGAAAtlT AGGGTATOGT TAGATTAGCT 
16801 TTAAAAATGT CAIGI I I lAT AAAAATONAT GCATCATTTT TGTGATTCAA 
16851 AATTTAACAC AAGACTCAGA ATLI I I I IGC AGTAGUGGAA TTACnTTAT 
16901 TATAGATOT TGCGATAATG AATGATGATA CATCFQGCCA AAAATAQGTA 
16951 GTATAGrcrr TTAQGAAAAC AGCTAATCre CTTCAAATAT GTOTAGAAAT 
17001 AATTTAGTGC ATCAGCCCAT ATTQGCAATA ACTTCrCTCr AAI I I I I I I I 
17051 TATAGAAAAT TTTTACTACr QGAGATGTCA ACAAAGATQG GAAGCTX5GAT 
17101 JTTGAAGAAT TTATGAAGTA CCTTAAAGAC CATGAGAAGA AAATGAAATT 
17151 GGCATTTAAG AGTTtAGACA AAAATAATGA TQGTXJTCrCT TTLI I I IGIA 
17201 TTTATCACCA GCTATGAAGA AGCATTTATC ATGCTTTCAA GAGTCTAAAA 
17251 GGATQCTTAT TTAATCrcrC TGbl I I lAGA TGATAATTAT TAI I IGIGI I 
17301 AATALI 1 1 1 1 TTTAGTAATG TGAI 1 1 1 lAT GTAGAGnTA TATTATTTAG 
. 17351 TGAAGAAAAC TTATAGATAG LI 1 1 IL I 1 1 1 TGATTACTTT GAAATCTAAT 
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17401 GAATTACATT TCTCAATTAA AAACTCrOGG CAQGQCCTCr TGTAAATCTT 
17451 AACTATGGAA CATTATQCTC ATTnGAGTTA AACCrGTAQG TTAAAAATAA 
17501 TAATTATATT TTCTTCTCCT CraOGTAAAA TGAGATTTCT TTTTATTTGr 
17551 ATAGAAGAAT GACA fa l I b I G TCATCTAAAA TTTAAAAAAC TTTCAGATTA 
17601 TCrreCATCT GTTA Cj I 1 1 1 1 TTQGAAGAAT TAATTTAGAG AAGATATCTC 
17651 TGATCCTCGA AATTAGGGAA AAATACSCATA TAAAOmTA AGTCTCTACC 
17701 TTCnSGTTAA GATTATGACT TCTATATTTC GATTAATAGG TnOGAGnTG 
17751 TCTTAATCre IMICIGIIb CTCTAATOGA GTACXACAGA CrQQGTAATT 
17801 TATGAAGAAA TGAAATTTAT TTCrTATAOT TCTOGAQQCT GQGAAOTTCA 
17851 AAGTTGAGCC GAATCTGGTG AGGGCCTCTT ACTATGTCAT AACATCCTAG 
17901 CAQGCATCAC AGAGCAAATC CACTACCTCA GATCTCrCTT CCTCI ICI lA 
17951 AAAAGCCACT AGTCCCATCA TQGGGGCCCT ACTCTGAAGA CCTTATCrAA 
18001 TrCTAATTGG AAATAGGGTC TTGAAGCCCT CATCACTAGA QGTAACCTTT 
18051 AACAGGAAGA GAGAATTTAT AAAAATTATA ATGCAGCACC AAATCCCTCC 
18101 CTALI IGlCaA ATAGTCAAGG TCATTTCATT TACAGAOTG TTATTAAAGA 
18151 AACAQGTTAA AONAATAGAT TGAGAQGAAA I G I GG I I CAT GTCnGAGATC 
18201 AQCAAACTTT TTTCTCCAGA AGTCCA^ATA ATAAATATTt TAG LI I IG l fa 
18251 GGTCATGTQG TCrCAGlTCT AGCTACTTGr CTCTiGCreCT GTACCTCAAA 
18301 AGCAGCCATS GATAATATCT AAATGAATQG GGATGACTGA TTTCCAATAA 
18351 AAACTTTATT TACAAAGATA GTTAATACAC CnATTPGGC TTGAGGGTTA 
18401 TAGnrOCCA TCCCiCreATT TACAATCAAT ATTAAAGm AATTCAAAGC 
18451 AAGTTCCrrc AAAONAAONA ACrAAAOCT AGATGATTTT GAAGATTATT 
18501 CACATCTCTC ACTCTCAGCC AGGAAGAQCT GAGTTTOGGT TQGAAAGTAG 
18551 TACTATPOGA ACAI I IGI IG CCCATAAGCC TTACAATATA TCCCCCTAAG 
18601 TCTAGCCTTA GrCCAGTCTT CTAGCAAAAC TOVOTTTTCr TTOTCrcrG 
18651 OW^CTTTCA TTCCAACATC GACCCTCTCC AGTTCAGATT GTCTTCCAQG 

18701 tongatpgk: -rcrcrecrec tatqgtaggc agtagctcag agatqgagct 

18751 ACCTTAAGAT CAATTCCCAG ATAATCAGAG GTCAATTATC CCAGPGCATA 
18801 AGTAGTXjrAC ATATCAATTC TTCATTTTAT AAAATTCTAA ATGAACCAGA 
18851 GGCAATAATT AAAGATGAAA TTtTCATQGT ATATITCrAG GAAATCTACA 
18901 CAAIGI I ICC CTAATTTCCC ATGI I IGIGI. ATTTTAAAAC AATGTCGCAT 
18951 TATTGOTTCA TAI I I I lATT TnTAGAClT CCTTAATGCA AAAGATATAC 
19001 AGTTGATCCr CATTATTTOG GGATTCTXnA TTTGCAAATT TGCCTACTCA 
19051 ATAAAATnA TCCCCAAAGT AACCCO\AAA TATATACTCA CAGTACTTTC 
19101 COVQQCATTC ATCGAOVreC ACAGAGCAGT GAAAAACTTG AGI IU.ILAG 
19151 CATCTACATT CCTAGCrAGT AGAATAAQGC AAtACrCTCC CI ICI IGI 11 
19201 CAGCrCTCAT ACTATTAACT AGCAAGTATC CCTTTCAAGG TCTAI I I IGI. 
19251 GCCAGI I III GCAI 1 1 I IGI Al I I I IGI IG GTAATTTCCr TTTTAAAATG 
19301 TTCCCCAAAG GTAGrcCTCA AGTXaCTOTCT AGTUTTCXTA AGTQCAAGAA 
19351 AGCCATAGO^ TCCCnATQG AGAAAATATA TQGGTTCGAt AAGCTTTGCC 
19401 CCAAATTCAA TGTTAGTCAA TCAACAGCAC ACATTAAATG AGGTCCCTTC 
19451 AAACAGAAAC AGACATAAGA CATQGTTATG TATTAATCAG TTGATGAAAG 
19501 TGmGTAATC AGAGGCTCAC AGGAACCTAA CCCPGI I I I I CCTGTAQGAA 
19551 CAATCGTTTC GTATTTGCTA ATTCAGTGTT TGCAATGAAT ATAGAACTTT 
19601 ATQGAAGATG ATIGCTCIGA ATAATCAGAA TTAACCATAT CTCTTTAAGA 
19651 GrOCATTTCr AAAGGAGAAT ATTCA6AAGG GTATTTCCAT AAI IICII lA 
19701 CTAACAGATC CTQCCTCrCA CrCTCaTAC ATQGTCCAGA TTCTCATGCT 
19751 GCrCCTTCCC TCfCCCCAGG AGGATTCTCT CAGAATCCTG TCATCTCCTC 
19801 CAQQGTCCTT TCTCCAAGAA AGTCTATCCr TTCACCACTA ACAGTAATTT 

19851 TQGTcrrccr ciiiiicigg agaagtc^qc tctttatqct Gcrro^QCAC 

19901 CAGACCCrCr GTTACI I IGI I I IGI I ICAT TCI I I 1,1 CAT GTACAGTAGr 
19951 CTTAGGATTC TCATGAGCCT 6TCAGCTQCT AGAAQ6AAAT ACAGCAGTCC 
20001 TTACATTTAT TGCTTCTATT TTATTTTCTA I 1 1 IC I C I I C Ci G ICI ICIG 
20051 Al IGI ICTCC TTCTXJTCCAC AAACATCCTC TAATTTCCCT AGTATTAAAA 
20101 AIIIICIGIC II IIGIIGII. CnTTATCCr TGCTCCCTTA TTTTrACTGC 
20151 CAGAI I I MA TTrTTATTTA TTTAI I I I IG AGATQGAGTC TCACTCTCTC 
20201 ACCCAQGCTC GQGTOCAGTG GCQGGATCTC AGCTCACTCC AACCTCOGCC 
20251 TCCCAGCrrc AAGCAATTTT O CtCI I MA G CCTCCCAAGT AGCTOOGATT 
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20301 ATQQGCACCr GCOVCCATGC CTCQCTCATT TrTCTATTTT TAGTTAGAGAC 
20351 GGGGnrCAC CATGTTQGCC ACACTGCTCr OAACreCTC ACCTCAQOTG 
20401 AACCACCOQC CTCAG CCTCC AAAAGTGCTC QGATTCCAQG TCTCAGTCAC 

20451 TCreccrooc cmrAcroc ovgattttta aaagaatagt ligigliiia 

20501 GCTCTATTrc CrCATTrACr ACTTCrCTTT AACTCAGTCA TATATCATGT 
20551 TTraCATAGT AAATCTCTAG TAATTTATTA AAAATCTAGA AATAGOTACT 
20601 TTTAAAATGA ATAGATCCTA CTTTAATfX3A ATTTATOTG GAGTTAGAAT 
20651 ATCTTGATTT QGATTTTAGT TCrQCrAOT CTTAATTACA TTACTraCTA 
20701 AQGCCACrnG TSAACTCAGT CrCTTTOGAG GAATATTATT TATCTATAAG 
20751 GCTGTTACAA TTAOnGAATT TTAAAAAATG TCTATTTATT TTTTAATCTA 
20801 1 1 IGI lACAT TnTAGTATT GATCTTCQGA TAQGCATTTA AGCAAGKTA 
20851 TAACrCACCr ACATCGOAA TTTTCCCTTA ATCAGTTTAA AGCnTCTCT 
20901 TAAATGAGAG ATTTCAAATT OVTAATTTCr GIUil ICI lA TCAOTTCTGA 
20951 GTTTTATTTT TTCCCCTTTT TATTTTTTTA AAQGAAAAAT TGAQGCTTCA 
21001 GAAATTGTCC AGTCTCTCCA GACACTHOGCT CTXyVCTATTT CTCAACAACA 
21051 AGCAGAGITG ATTOTCAAA GCTAAGCTCT TCATGTraGT OWZAATTGA 

21101 crmovcrrr aatatxxtoc attagaactc igiCjIiigia agfgtoqctt 

21151 TAAMCACCr CCCVAGVCTT CATTATGTAT ATGCAAGATC I I I I IGILI I 
21201 TTTTCCrCCC ATTOVrnTG TATCTCTACA TTTATCTAAA GTXnAAGAAT 
21251 GGGAAGTCTA AGCTCAGACT GGAGTCnTC TTTCAAGGCC TCAAAGGATA 
21301 GTOGMTOGC AQGAAGTAAG GmTAACTC OVTAGATGAG GAGCTGAAGA 
21351 Gl 1 1 IGGIGI TQLIIIIICr CCAmnGATT TCrAATCTCA OVGTAAAACT 
21401 CATPGATTCA AACTAAGAAG ACTAGCAGAT TCATCACATT ATTTAACCTA 
21451 GATGTGACre GAAAAAAQGG AAATTACTAA GCTCTCCAAG CTAACAAAGA 
21501 AATACCTCTT TAAAOTTCA GAAAACAGAA ATQCAAATTT GAACCTTATT 
21551 GTCPOOGGO^ ATCAGTTPGA CTATTTAAGT CAGACTTTTA TACrOTAAT 
21601 GllllbillC ATOGGATAGA GOVGTAATCT CTQCAGCCCA QGrreCTCTOV 
21651 AATACTCrcr TGCTATAAAC ACAQGGCAGG AACreATTTT TTATGATAAC 
21701 GTAAAACAGA AAAQGACAAT TATATTCTAT TAATAI IGI I GTCAATATTT 
21751 TCAGTCCrCA CATTUTCTAA AAATCnTCT AAATGGCTTT GTTATTGAAT 
21801 TTATCTCATT TTATATCTGr GCCAACAQCA TTTTCArCCr TTCTCITC^T 
21851 AAI I lU 1 1 1 ACAAACAGCT GCTCAAGAGG AAQGCTCAAA GTCrCAAQGC 
21901 TGAGCACGTA ATGALI I I IG TTAGTACTAG ATGAGAAQGG OTTCCTCAG 
21951 GAAATGAAAA CCTAAAACAT GAAAAGAAGA TAAACAGAAT TTOGACAGrG 
22001 AGATATAGAG CATATAATAT TLIGCI ICTA AAGTAATATT OTCTAGGAA 
22051 AGTGAQQGCG TTTCCCTOGC TCTTAGGCCA GAAATCATAT TCCTATATTT 
22101 TCnTGATAG CTTTAGGAAT AATGCAAATT CTAAGCCCAA GCTTCAGAAT 
22151 AGACTAAGAA GrATTAGCTT AGCPGCGATG ACAAAATACC ATAGGCR5GA 
22201 TGCATTAAAC AATQGAAATT TAGIillICA O^GGTCrOGG AGCTOGGAAG 
22251 TTTAAGATGA GAtTTQCXZAQC ATOGmROGGT TCTAGnRGAQG GCrCTCTTTC 
22301 TQGCTTGCAG ATAGACCCCT TCTOACTCTA TTCTCATATS GCAGAGAGAG 
22351 AGAGAGAGAG AGAGAGAGAG AGAGAGAQGG GATCnTCTC TTGCnTCTA 
22401 TTATAAGGCC ATAGTCCTXJT TGGATCAQGG TTCCATTCrT ATGAOTTAT 
22451 TTGACnTAC CCCCXTAAGA TGCTATCTXIC AGATATAATC AOVajGRSGG 
22501 TTAQGQCCrc AADMTTCGA TTTCQGAQQG ACACAGCTCA GTCCATAGOV 
22551 AAQGATAATC CAGAGQGTTG GATATTTAAA AGT/^XTACA CAAI I II lAA 
22601 TATAAATATT TTATCGrAAC I II II II I II TnTGAGATTG GAGTCTAGCr 
22651 CIGI IGCCCA QGCnSGAGCG CAATGGTGCG ATCTCAGCTC ACTGCAACCT 
22701 CCfiCCrCCO^ GGrrCAAGCA ATTCrCCreC CrOVGCCrCC TGAGTAGrre 
22751 QGACTATAQG CACQOQCOVC CACQCCTOQC T A I 1 1 1 I I M TTATTTTTAC 
22801 TAGAGAGGGG TmOCACCAT ATPOGTCAGG CTTCrCTCGA ACTCCTGACA 
22851 TCAGGPGATC CACCCATCTT GGCCTCCCAA AGTGC^GGGA^ TTACAGAAGT 
22901 GAGCOACCGC GCCTAGCCAG CAGCTTTACT GAGATCTAAT TCACATQCXA 
22951 TAAATTCACr TTTCrAAAGT ATACAATTCA GTCACTTAAA ACATTTATTT 
23001 Al I II lAAAT TGACAGAATT ACATCTATTT ATCATGTACA ACATGATCTT 
23051 TTCAAGTATA TCTACATTCT GGAGTGACTA AGTCTAGCTA ATTAACATGA 
23101 TACATCrCAT AOTAATGAT TTCTCTOGITG AGAACACTTT ACATCCATTC 
23151 TCTTAGTATT TTTCAAGAAt ATAATATATT ATTATTAATT GTfiGTCTTCA 
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23201 TOTTCTATAG TGGAGCTCTT GAACTTATTC CTOVTCTCAA GCTGAAATTG 
23251 TCTCrCCTTT AAOVCAAACC ATACCCGACT CCCAAAOTAT TCreCTCTCr 
23301 GCTTCTATCA GATTAACTTT TTCTGATTCC ACATGAGTGA GATCATQCAG 
23351 TATTTATTTC TCTTTACrre GGTATTTOV TTCATATTCr TACAGATAAC 

23401 AQGAnrccr t l i j i i i i ia atggccgmt agttttctat tgtatatcta 

23451 TAGCACATTT TCrCrCTTCA TGCATTGGTC GACACTTAGG TreATTCCGT 
23501 ATCTTOGCTA TCCTGAATAG TGCTATAATG AACATQQGAA TGCACATGGC 
23551 TCnTGACAT ATTCATTTCA nTTATATAT GTUTATATAT ATATCTATAC 
23601 ACACAGATAC ATACAGTOGT QQGATPQCAG GATCATATOG TAGTTCTATA 
23651 TTTAAI I I 1 1 AAAQGAACrC CATACTQCTT TCOVTAATQG CTCTATTAGT 
23701 TTAACrCCrc ACCAACAQGG TGCAAAAGTT CCCTTTTCrC TAOXTAOTG 
23751 CO\ACACTTC TTATLI 1 1 lb TCTCnTQGT AATAGTCATT CTAAGPGrAG 
23801 TATGAQGrak TATCTCATTC TCGCTTTTAT TTCCATTTO' GraJTAATTA 
23851 GTGATATOGA GLI 1 1 I I 1 1 1 1 1 I I I IGTAC TTTCGCCATT TUTATOTCTT 
23901 TGAAAAATGT CTATPGGGGr TTTTTCGrre TTTATmSAG GnTTTWNN 

23951 mmmm mmmm mmtmm-mumtm nnnnnnnnnn 

24001 NNNhMNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NhO^NNNNNNN 
24051 NNNNNNNNNN NNNhO^NNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
24101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
24151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
24201 NNNNNNNNNN NhD^NNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
24251 NNNNNNNCGG QQGTTCCCXn: Om-CTCGCT GCCnCAGCCT CCCCXyWJTA 
24301 GCTXjGGACTA CCAQGGCACG CGCCCACCAC QGCCajQGCr AAI I 1 1 I lU 
24351 ATGTTGAGTA GAGACGQQGT TTCACTXJTUT TAGCCAGGAT GGTCTTGATC 
24401 TCCPGGCCrC GPGATCTX5CC CGCCTCGGCC TCCCAGAGPG CTAGGATTAC 
24451 AQGC3GrGAGC CACGGCQCCT QGCCTCATTT CTAbI 1 1 1 1 1 ATTATTCTOG 
24501 TGQGAAAAGA AACTTCATAT GATTTCATTC TGCTTAAATT TCTTAAGACT 
24551 TGI I I IGIGG CCTAACATAT GATATCCCCT GGreCATGTT CCATGTGCAG 
24601 TTGAGAAGAA TGTCrATTCT OTGCCATTA GGTCAAATGT TTTATGTCTC 
24651 ATCrcrCCAT TTGTTCTAGA GTATAGnTA AGTCTGATGr TTCTTACrGA 
24701 I I I ICIGI IG AGATGATTTG TCTATTGCTG AAQGTAGQGT GrPGAAGTCC 
24751 CCTACTATre aUTATTCCA GTCraOCT CCTTTCAGAC GTATTAATGG 
24801 II II lATTTT ATTTTATTTG I IGI IGI IGI IGI IGI IGI I Gl IGI I 1 1 IG 
24851 AGACQGAGTC TCACTCTXn'C ACCAQGCTCG AGreCACTQG CAGGGTCrCG 
24901 GCTCACreCA GCCCXZaJTCT CAC3QGTTCAA GCGATPCTXX: TQOCTCfiGQC 
24951 TCCCGAGTCG CTQQGACrAC AGGCGCATCC CACCACGCCC AGCTAATTTT 
25001 TGTAI I I I IA GTAAAGAOjG GGTTTCACCA TGTTGGCCAG GATCGTCrTG 
25051 ATCTCTTCAC TTCATGATCC ACCCGCCTTC GCCTCCCAAA GTCCTX3GGAT 
25101 TACAQGrere AGCCAOCACC CCPQGCCAAT GTTTQGrATT TATOTTAGG 
25151 TGCTCTGATC TPQCjOTTCAT ATATATTTAT AAAAAACAAT AGCTACATAA 
25201 CTTATTAAGG GATATCGVAT ATAAAATATA TAAATTCTGA CACTCAAAAT 
25251 TTAAAATGGG AGGAGTGGAG TAAAAGTACC TTCATATAAC TTACTATTAT 
25301 ATCCrCITAT TGAATTGACG CTTTTATCAT TATATAQGAA LI I IGI I ICT 
25351 CCTTTAO^AC TTCPGACTTA AAGI I IGI I I TATATCATAT AAGTAAACTT 

25401 Acrccrecrc tgliiiggm tligiiioca tqgaatatct Tnrcovrrc 

25451 OTCACCATC AGTCTUTCrG TAI I I I lACA GATGAAATGA GTCTGrCATG 
25501 GGCAGCATAT AGrPGGATCT AGI I I I I I IA ATCCACTCAG ACACTXjIGI I 
25551 TTTTGATTOG ATAATTTAAT CGATTCATGT TCAAGGTAAT TATTGATAAG 
25601 TAAQGACTTT GTACTACXAT 1 1 IGLI lATT GnTOVTOGT TCTTTTATAG 
25651 ATCCTTTATT CI 1 1 ICI ICC TCTCI IGCIG TCI 1 1 1 1 M I G I GGI l A AGT 
25701 GATTTTCrCr AGraGTATCT TTnGATTTCT TGLI 1 1 1 lAT I I 1 1 IgIGIA 
25751 TCrCCrATPG GI III IGG I I iGlGGI lACC AAGAQGTTAC AAAAAACATC 
25801 TTAAGAGTTA TAATACTTTA TnTAAOTG AtAACnAAT TTTTATTCCA 
25851 AAAACCCCCC AAAACAAAAA AATCTACACT TTTACTTAAT CCCCTTGAAAT 
25901 TTTCAATTTT TGATCTCACA GTTTACCTCT TTTCATATTG TCTATCCCTT 
25951 AAATTATPSr AGCTATTATT ACTTTTAATA GmTCTCn" TCCTACTACA 
26001 GATOT AAGre ATTTCCATAC CATCATTACA GrATTATTTT GAATTTACCr 
26051 GTOTACTTTT TTTTATCAQC CAGmTATA CntCAGATG M I HG I G I I 
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26101 ACrCATTAGC ATCTTTTTCT TTCAGCmSA GGAGCTCCTT TTACGnTCT 
26151 TATAAAATAG GTCOiGTCAT GATTATCTCC CTCAGCTATT GTTTCTCTXjG 
26201 GAAAGTATCr CrCCFTCATr TCTGAAQGAC ALI I IGLIGG GTACATTACC 
26251 C I I Cib i I G GT A I II I I L I C C TTCAAGQCTT TAAATATATC ATCCXnTCr 
26301 CTCOGACCr GrTAGGTCrc TQCTTGACOVS TGTOTTTCCA ACCATATTQG 
26351 GALIGILI IA TATCnATTT GCTTOTATC tfTTCCTXnT TTCAGGATCC 
26401 TCrCATrcrc TTPGAI mm GATAGTTTCA TTCTAATATG TCmSGGCTA 
26451 GTLI Ibl I IG GATTCAATCr GATTAGAGAC CTTCGACTTT TCGrOCATGT 
26501 AGATATTTAC CTLI I ICICC AQCnTQOVA AAM I ICIGI TACIGI I ICI 
26551 TTAATTAAGC TTTTTACCCC TTTTATCTTC CmTCrCCT TCTTCAACTC 
26601 CTCTCACrCA AAAdTTCCT CI M IGATQC TCTTCXATAA ATCTT GTAAG 
26651 CI MC I! C AT TCAmTCAr ICII 1 1 MCI OCrCrGTUTA TTTTOVAATA 
26701 AC C I G I C I 1 1 GAGTTCATAG II ICM iCIl CI ICI IGATC ACI IC IGCAG 

26751 rnGATGcrcc o^TAmscAr tttaai i i ig ircATrerAT rnrcAGCCC 

26801 CATGATTTCr GTrPGATTTT TTCmTATT ATTTCATCTC TTTATTACCT 

26851 TTcrcnrcr OGrcAcroGT tattttccta atttcattga aiigiiicii. 

26901 TGTATTTrcr TCAAGITTGC TGAG CI I I C I TTCAATTCTA TCTX^^ffmCA 
26951 TACATCTXTC 1 1 ICI I lAGG GATQCTClGCr GGTACTTTAT 1 1 IGI I ICI I 
27001 TAGTXSGTCTC Al I IGI ICCT GATTGI IGI I GAIGI I IGIG GCCI IGIGI I 
27051 TAC^TCnmS CATmSAAGA AGTAGGCACT TATTTCAGrC TTTGCAGACT 
27101 GGCMIGICr GAGAATGCCr TTOVACAGTC AGCXnUTCTA GAGATTCTTT 
27151 AATATFTAAT TAAATATCTT TAATATTTFG AAGAACTTCC AAAI IGI I IC 
27201 TAAAGTlQGCr GCACCATTTT ATAATCCCA6 CAGCAATGAA TCA^QGTTTC 
27251 ACTTTCrCCA TAGCTATATC AATACTCATT ACIGICTGTC TTTTCATTTT 
27301 TTGATTTTTA 1111111111. GAGAAAQGGT CmSCrCTGr CATCCCATGT 
27351 GGAGTOOWr GQCWIAATCA TOGCTUVrrc OVQCGTCMC TTCCCrOQCT 
27401 O^ATTCATCG TCTXACXnXX TCAGTACCTC QGACTACAQG CATTG TACCA 
27451 CAATCCCTOG CTAAI III I A TAM I I I IGI AGAGATGTQG TTTTGCCATG 
27501 TTCCCIGGrG TATTAGTCCA TTCTCATQCr GGTATAAAGA ACTGCCreAG 
27551 AGreOOTAAT TTATAAAQGA AAGAQGITTA ATTCACTCAC TTTTCCTraG 
27601 CTCAQGAGCC CTCAQGAAAC TTACAATCAT GGTCGAAGGG GAAGCAAACA 
27651 CGTCCI ICI I CACATGATQG CAQGAAGAGC AGreCCTAGC AAAGAGQGAA 
27701 AAAAACCCTT ATAAAATAAT CAGATCTCAT GAGAAGnTTAC TCACTATCAT 
27751 GAGAACATCA GAATCAQQGT AQCCrCCTCC ATGATTCAAT TACCTCCCAC 
27801 TQQGTCCCrc AOGTCAGMG TQQQGATTAT TG GAACTAT A ATTOWVATG 
27851 AGATTPQGGr GAGGAOVCAG CCAAACCATA TCAI 1 1 IIGC CCTXsGTCCCr 
27901 CCCAAATCCC ATGTTCTCAC ATTGCAAAAC AGSATAATCC CTTTCGAGCA 
27951 GTCCCCCAGC GTCTTAACTC ATTCCAGCGr TAACCTAAAA GrCCAAGGTT 
28001 TOVrCAGAGA O^AOGCAAGT CCCMCIGCC TATAAGCCTC TAAAATOVVA 
28051 AGOVAQGTAG TTATTATACT TCCTAGATAC AATGAQQOTA CAQGCATTGA 
28101 TTAAATATAC I IGI ICCAAA TGGGAGAAAT TOiXOVWVT GAAGQQGCTA 
28151 CAGGCCCCAA GTAAOTCCGA AATCTAGreG AATAGTCAAA TCTTAAAGCT 
28201 CCAAAATGAT CrCCTTTGAC TCCACATCAC ACATCCAGCr CATC CTAAT G 
28251 CAAGAAGTQG GCTOCCATQG CCrTGOGCAT Cre^CTCCT OTOQCTTTTC 
28301 AQQGTAO\GA COXCI ICIG GCFCI 1 1 ICA OVGGCFGGGG TTGAGTCTCr 
28351 GTCGCmrC OVjGTXSIATG gtgcaagctg toqctggatc tactattcpg 
28401 GGTACTQGAG GATQOTQGCC CrCTTTTCAC AGCTCCACTA QGCAGTCGTC 
28451 CAGTGQQGAC TCTTGTGnGAA QGGTCOVACC CCACATTTCC CI ICIGCACT 
28501 GCCCTAQC3QG AGGTTCTCCr CAAGQGCTCC ACCCCTGCAG CAAACI ICIG 
28551 TGTQGACATC CAGGOmTC CATACATCCT CPGAAATCTA GQOVGAQGAT 
28601 CrCAAACCTT AATTCTTATC TTCTCTGTAC CCGCAGACTC AACACCTTGT 
28651 GGAAGCraX AGGQCrraOG QOKXACCT TCTCAAGCCA TQGCCTCAGC 
28701 TCTACXTPOG CrGGTTTAG CCATOGCraG GATQOVQQQC ACCAAGTCCT 
28751 GAGACreCAC AAAGO\GCAA GGCCCTOGGC CTQGCCCAQG AAACCATTTT 
28801 TTCCrCCTOG GCCTCTQGQC CTATGATQGG AGGQCCCTTC CTGAAGACCT 
28851 CrcAAGTGCC CTOGAQGOVr TTTCCCCATT GTCTTAGTXSA T TAACAT TTC 
28901 ACrCCIIGII TCTTATQCAG Al I ICIGCAG CreGCTTGAA Mill ICCTC 
28951 AGAAAATAGA I M MC I M l CTCHPCACArC ATOVQQGPGC AAATTTCACA 
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29001 AACmTCTC CTCTQCTTCC TCTX3GAATGC TmOCCACrT AGAAATTTCT 
29051 TCreCCTCAT ACCCCAAATC ATCTCTCTTA QGTTO\AAGT TCCAOVGATC 
29101 TCTAGGGCAG QQGCAAAAAG CCACCAGTCT CTTTCCrATA GCATAAO^AG 
29151 AGTCATCnT GCTOCAGmC CCAAOVAGTT CCTCATCTCC ATCPGAGATC 
29201. ATCrCAGCCr QGACTTCAtT GCCCATATTA CTCTCAGCAT TrPQOTOWV 
29251 GCAATTCAAC AAGTCTCTGG GAACTTACM ACftTCCCAC CTCI 1 1 l lGl 
29301 GTCTGAGCT CTCCAAATTT TTAAGAAGIT CCAAACTTTC CCAGTCTtCr 
29351 TCTCAACCrr (XTAACTOTT CCAACCTCTC CCTCnACCC AGTrCCAAAG 
29401 TCAGrrcCAT ATTTTTOGGT ATCCTTATAG TAQCACXjCAA CTCCTAGTAC 
29451 CAATTTACTC TATTAGTTCA TTCTCACGCr GCTATAAAGA ACCACORSAG 
29501 AATQQCTATT TTATAAAQGA AAGAGGTTTA ATTCACTCAC AGTTTGQCGT 
29551 GGCreOQGAG GCCTCAGATA ACTTACAGCC ATAGCAGAAA QQGAAGCAAA 
29601 CATCTCCTTC ACATCGPQGC AQGA^GAAGA AGrOCTCAGC AAAGAQGGAA 
29651 AAQCCCTATA AAACCATCAT ATCTCXJTXSAG AACTCACTCA CTATCATCAG 
29701 AACAGCAGCA TOGGGITGAC CACCCCCCAT AATTCAATTA CCTCCOkGOV 
29751 GCTCTCrCCC GTCACACATG GAAATTATCG GAACTACAAC TCAAGATGAG 
29801 ATTTOQGTX3G GGACACAGCC AAACCATATC ATCrAQQCTC GTATCGAAAT 
29851 CCTQGGCTCA AGCAATCCAC CCACOTGCC CTACOWVCT GCTCQGATTA 
29901 CAQGCATGAG CCACCATATC TGAACTCTCT TTreATTTCT TTTGATTTTA 
29951 ACCATCGATT Gl I I GTCTAGATAA CCCrGACTAA TATATAATPG 

30001 GTATGAAGTC ATATCTCATC GCTTTCATrT ATATTTCTTT CATGOCTAGT 
30051 GACI 1 1 1 I I I GTACnrnOG GATAI IGMA TTATTATTAT TATTATTACr 
30101 AGIGI I lATA CI ICI ICAGT AAAAGTUTTA GAAACAATTT TTAAAQGCAG 
30151 AATCTX5ACGA GAGTTTCCTG TAGTTATATA ACCATCATQG ACCTTCCCrC 
30201 AAGTCCTAAG CCATTAGTGT TACrCATCTC ACTCCAAATG TCAGLI IGI I 
30251 TTCTTCCATT TCACTOTCTC M IGIGIGCC AAAOTGAAT TCATQQGAAA 
30301 AACATCTCAA TGblGLI lAA TAtOGTmOG ATATTTCTCC CCTCOVAATC 
30351 TGATCTPGAA ATATCACCTC CAGPGnGGA AGTAGGGACT ACmOGGrOV 
30401 CGAGAGTOGA TCCTTCATTA ATQGCnGGT AATAAGTGAA CTCTATrAGT 
30451 TCATGAAAGC TGGTTGTTGA TAAGAGCCPG GCATCTCATT TCrCTTCTCC 
30501 TTCTCrCACC ATCTCACACA aTGCTCACC II III ICIIC AGCCATGAGT 
30551 AAAAGCTTCC TGAGGTCrCA CCAGAAACTG AGCAGATGTT GGTGCCATGC 
30601 TTCTACAGrC TGrTAGAACTG TGAGCCAAAT AAGCCTCnT TCTTTATAAA 
30651 TTACC3GAGrc TOVOGTCTTC GTTTAAAACA ACACAAAACA GACTAACACA 
30701 GrcrPGATTG AAACAGCTCTr GACPOOGltA TCAQQGTCTA AGAGAQGAGT 
30751 CACTCAGmS AAATATAGCC TCCTACTTAC ACCTCTTCAG TAGAAGCTGr 
30801 AGATATGAAG TAGCTGAAGC AGGCATTCCC TCTGAAACAT GTCTmCACA 
30851 TATCTCATAA TTATCI ICIG CTCTCATnT TCmTAGGC TTTTCTCTCC 
30901 ATCrCATTTC CCCIbl I lAC TCTCATTTTC ATATCTTTAC Al I ICI I ICT 
30951 CJCAGAATTGT TCAGAAGOT GGAACCOTC ACTCOVOTTA TTOTTCACr 
31001 ATGCAATTTC TTTCrcTGCT TCATQGCACT TATOGITTCr AATCCTTCAC 
31051 I Ibl I IGTAT AGCrCAGTCG TTAGGAGTAC AGlTTQGAGr TAGAATQCCT 
31101 GGGmSAAAC TOTAATTCT ACTCrACTTA CTAGrCTTCr GACTATAACA 
31151 AAATrCITAG CCrCrCnTG TCreTAAAAT GGAGAGTATA GTAAATACAT 
31201 GGGCI IGI 1 1. TAAQGATTAA ATGAGTTAAC ATCTCAAATA CTTAGAAOVA 
31251 TGCCTGGCAA ATCCTCAATG AATATTGAGT Al IU.I IGCT I I IGI I lAGT 
31301 GCCATGCCTG TTCTTCCCAC TGAGGGCACA GACOVTGTCT ATCTGGTTAA 
31351 CAGTTCTATG TCCACCACGT TGCAATAATG GACTCTCAGA AAATATTGAA 
31401 GAATATCTTA AAGAATGAGT AGAATTAIGC TACTGAAAAG GGrGAGTOGA 
31451 AGGTAQGTAG QGGAAAGGAC ATATACAQCC CTQGAQQCAG CATATATGGG 
31501 GAATCGGTCA CACAGIGI i I CrraGTACTC TCTAGACCAT AGTGQGCCAC 
31551 CrOTAGCTA GTOQCCTATG GAtTATTTCA GCAGrCTCTT QGAAACATCC 
31601 ATGAATATGA TAATAATGAC OZAI IIGIGG GTTCTAAGAA AAAGGACAAC 
31651 TACAATACTA GACAATAATA GTATCTAACT TAQGAQG6AA GQQGATCATT 
31701 TGTATTAAAC TGnTCTAAAA TTCTTACOT ATTTAGGATG ATCGQGTCAG 
31751 ACATTAACTT TAGACI I IGI TATATATATG TQGTAAAATT TCAAGGTAAA 
31801 COVTTGAAAC TCTAGTAGIT GAGTATATAA CTTCCAAATC AQQQQQGftAA 
31851 GAAATQGAAT AAGAAAATAA ATAOVTAAAC ATAAGATPSA AACAATGDVA 
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31901 TGAA6AGTAG AGAGAAGAQG GAAAAACATA GAAAGAATCA GATAATTAGA 
31951 AAGCAATAQG TAAGATGTCA GAAATAAATT CAAGTACAGT AAAACTCCAC 
32001 TAAAATCTCC CCTQCAGTAA TCTraOOQCA TGATTTCCCr TCATCCCCAT 
32051 TCrOVAATQG QQCAGCCTAA ATAQCXnTCT TATCLIGI 1 1 OCCraQOGGT 
32101 TreAQGreGG "PSAClGAGrAA GTTAGAAGAT AATCACCTTC TCATO\GTTA 

32151 GGAcrircrc AGtrrAGrcr tcAArfAAtA aMattaatc TAMtntAt 

32201 CAGAAQGCAG AGATTTJTCAG A7GAAAGAAC AAGCAAAATA AAAGTOTAC 
32251 TGAAAAAAAG CroOGGTAGC TATCTTAATA TCAACTCrTA ATTATTATTA 
32301 ATAATCTATT AATAATAGAT TATATAGTAA AAACATTAAT AAAAATAGAG 
32351 TGTCACTAOV TTTTAAAATT CAGTATGAQG ATATACAATT TTrAAGCTGG 
32401 TTGATAAAAT TCTOGGGATT AATTQGCAAA TCCATGVTAG TQGTGAGAGA 
32451 TTTTAACACA ATTCrrcCTG TATTTGATAG GTCAAGOVGA GAAAAACTTT 
32501 AGTCAAGACA AAAAdTCTA AATACATAAG CTTCATTTAA TOGGOVTCTA 
32551 ATAQGACCTA GOVTDWVAA ATTAGAAAAA ATA! II IMC TTAQGrAtTT 
32601 ATQGAACATG TATAAAAATT GATTTC3GTAG TAQGCCATAA AGCCAGGTTC 
32651 AAGACATTTC AAAGAACTT3G TATCACAAGA ALIU.I I ICT CTGACOOA 
32701 TCCATTAAAA TAGAAGTTAA TTACAGACAT AAATTATAAA AATQCXAATA 
32751 TTTTAAAGTC TGATATACAC TTCTCAAGT ATQGGTCAAA QGAAATCGTA 
32801 AGTOGAAATT CAAGGACACG TTGACTTGAA AAOVTTAAAA: CTTATQGAAT 
32851 ATrrCTAAGA TGGAACTTGr ATGAATDTA TAGTCPGAAA GCTTTTATTA 
32901 GAAAAGAATT. AAGTCTGAAA ATTAATGTGC TAAGrTAQGG GAGAGAAAAT 
32951 GGAATAATCF OGAAGAAQGT AGGAQGAAQG AGATAATAAA GAATATATAG 
33001 CAAAGATCCA GTAACAQGAT CAACAAAGCC AGAAACRJTT GGAAAAGikCA 
33051 AGCCrCreGA AAGATPGATG AAGAAAAAAG AGAAATGAGA TCTAAATAAA 
33101 TCATCTTCAG TTATAAATAG GCACATAAGG ACTTTTAAAA AACTAATAAA 
33151 ATAATATGAA TCATTAATGC CAATAAATTT GAAAAONGAC AAAGTAQGTC 
33201 AATTTCTAGA AAAATATAAC TTAGTCGGAC TCAATGAAGA AGCAAC^iGCr 
33251 TATAGTACCT AAQO(\ATTGA AGAGATTQQG TO^AATTT AAAATTTTCT 
33301 CATAAACAAA ACGTTAGGCC CAGATXXJTTC TPGOWVPGA TTAAAGAACA 
33351 GATGTACAAA CATTTCCAGA GTCTAGAAGT AOVCTGTCCr ATCCTTTCTA 
33401 QGAGATCATT ATAACACCAA AAQCAGACAG TATATGAAAC AGGGAAATTA 
33451 GAQGCCAAGA TACCTATGAC TTATATCrAA AAATTTAAAG AAAATATTAG 
33501 CAAACPGAAT CAGCCATTTT AAAAAATATA CCACAATCAA TCCATTCATA 
33551 AGAGCAGCrr AACAAAATTT GTTAGAAQGC AtTAAAGAAG ACTCAGTATA 
33601 GAAAAGATCT ACCrrciOt OVAATPOGre atagagattc aatgccatta 
33651 AAAAAACCCA CCTOGITTTT TTGAQGAACT TGrCAAGCTG AGTCTCA^AT 
33701 TTATATCAAA GAGCAAAGGC CTAAGAATAT CCAGGACATT CCTGAAGAAC 
33751 TGTAAQGAGC CAGQGGCCTG CCCTATCAGA TACCAAGGGT TGTTATTAAG 
33801 CCATAACCAA GrOVGTGCrG "TTTCrACAGA AACAGACAAG TTAACAAErTG 
33851 AAACATAATA GAGAQCG0«3 AAACAGACCC ATGOVTATTT TQGATTTCTC 
33901 ACXJTGAAAGA AGTAGCTTTC CAAAACmG GGAAAAGGAG AGTXjnGTGCA 
33951 ATAGATGATG CrCGTGCrCA TQCAGACAAA AAGGAAATTG QGATACCTGG 
34001 CrCTTACCGT ACACAAACAC CAACCTAAAC GrGAAAGTTA AACTATAACA 
34051 G CrnGA GGTG GTXiQQGAAGA AATATCTTTA TCTDVGTCTA QGGAAGAATT 
34101 TATTTTAAAA AGAAGACACA AAAQGCCATA CATAQGAATG AAAAGATTCA 
34151 ATTCAGCTGC ATTAAAAAGA TTAAATTCAG CTX5CGTTAAA ATCAAGAGCA 
34201 TCTCTACrnG GACAGCATAG AGTGGAAAGA CAAAGAGAAG GTATTTGCCA 
34251 GCTTATAACr TGAAGGATTA GAATGAATCA TATAAAGAAC TATGTAAATA 
34301 AGAAAAAGAC ATACMCGGG TTAGAAAAAC GGGCAAAGAC ATX^AACAGCA 
34351 TATTTCACGT GAAQGAAACA GGQGTAGO^ ATGAACATQG TAA6AGATCC 
34401 TCAACAOGTT TAGTAATTTC AAQGGAAATG CAAGTTATAC CCACAGCAAG 
34451 ACTATCTTAT CTAGGAAGTT TGTCAATACC CTAAATCTTC TCreGmTA 
34501 AGCTACAGAG TTTCTAATTC ATTTATTTAT TCAATAAATA CTCAOTOGCA 
34551 GGCACIGI I I TAGAAACCTT GGTTATAACr TTGAATCAAA TTAAAAAAAA 
34601 TCOTGCCTT GrOGAQGATG CTTATGrGTG GGGAGTrOGG TOGTOGGGrrC 
34651 AAACAAOl^T TACATTAAAA TAGAAAATAG TGAGVTAAAT AAACCTATAA 
34701 ATATPGCAAC CCAGAGTTAT ATTATAAATG TAACTAGTGA CTAQGACTCT 
34751 CATQCAGATA TACOnCTUTC GHOGGACAAA TGAAAGTTTA AGTOTAATTT 
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34801 CCCATATSCA AGTCAAAATA AAAAGTCACA CTAGAAAACA CAATAATGAA 
34851 TATCTCAAAA TTCCAmTA nTCACreCC ATCCI I I IGC ATCATTTTCA 
34901 TACTAATTAT AGAATAAAAT TTCTAQGATG OWXAAAGCT I I 1 1 I lAGAG 
34951 ACATCCATTA ATTCAATAAA TAAATGAQCA CCIILIIICil GCCAQCAGCr 
35001 GTAAGAGGTC GCCCAAQGAA GQGAATAAAA CAOTCAAAAT CCTX5GTACAC 
35051 TCAGACTTTC TOTAQGAGA AAACAGATAC AAATQGCATT AATTACCAfiG 
35101 AAACTTCTAA AACAAGCCAA ATATTAATCA TAAATATTTG AGTACAGTAT 
35151 CTTAATnTA AGATTCAAAA TGAQCTGCCA GGATTTCTTA AGACTCAAAG 
35201 GGGAAGATQG CTCAATAGGA ACAGCTCTOG TCTACAGCTC CCAGC3GTCAG 
35251 GGACjGCAGAA GACX5CATGAT TSCTGCATTT CCATCTCAGG TACGQQGTTC 
35301 ATCrCACTAG QGAGTGCCAG ACAGTX3GQGG CAGGTCAGTC QGTGTGTCCA 
35351 CCGreOQCGA GCTCAAQCAG GGGGAGQCAT TQCCTO^Crc GQQWjTGCA 
35401 AQGGCrCAQG GAGTrCCCTT TCCTAGTCAA AGAAAGQQGT GAO\GA7T5GC 
35451 ACCTQGAAAA TCGOGTCACT CCOVCCTGAA TACTXXAan TTCTGAGGQG 
35501 CTTAAAAAAT GGCGCACOVG GAGATTATAT CCTCCACCTG GCTGOGAOGG 
35551 TCCTACACOC AaSGAGTCTC GCrGATPQCr AQCACAQOVG TCTGAGATCA 
35601 AACrcCAAGG CXiGGGGGGAG GCTGGGGGAG GGGCACCGGC CATTGCXXAG 
35651 GLI IGCI IAG GTAAACAAAG CAGCCQGGAA GCrCAAACnS QGTGGAGCCC 
35701 ACCACAGCTC AAGGAGGCCT GCCTGCCrCT GTAGGCTCCA CCTGPQQGQG 
35751 CAGGGCACAG ACAAACAAAA AGACAGCAGT AACCTCTGCA GAGTTAAATG 
35801 TCCCTCTCrc ACAGCrmGA AGfiGMXfiGV. QOTTCTCOCA GCAGGCAQCT 
35851 QGAGATCTCA GAAOQQQCAG ACTQCCrCCT CAAGnSGGTC CCTCACCCCr 
35901 GACGCCCGAG CAGCGTAACT GG6AGGCACC CCCOVGCAGG QGCACACTCA 
35951 CACCVCACfiC AGCC3GGTTAC TCCAACAGAG CTGCAGGTCA GGGTCCrGTC 
36001 TGTTAGAAQG AAAACTAACA AAO^GAAAGG ACATCGAOVC OW\AACCCA 

36051 TCTXirAOvrc accatcatca aagacgaaaa gtagataaaa co\caaagat 

36101 GGGGAAAAAA GA6AGO«3AA AAACTOGAAA CTCTAAAAAG CAGAGreCCT 
36151 CrCCTCCTCC AAAQGAAOGC TCTTCCTCAC CAGCAACQGA ACAAAGCPOG 
36201 ATQGAGAATC ACTCTGAGGA GCPGAGAGAA GGCTTGAGAC GATCAAATTA 
36251 CrCTCAGCTA TQGGAQGACA TTOVAACONA AQGONAAGAA GrTCAAAAGT 
36301 TTGAAAAAAA TGtAGAAGAA TCTATAACTA GAATAACCAA TACAGAGAAG 
36351 TCCTTAAAQG AGCTCAT3GA GCTGAAAACC AAQGCTCGAG AACTACATGA 
36401 AGAATGCAGA AGCCTCAGGA GCTGATQCGA TCAAGTTGGAA GAAAGGGTAT 
36451 CAGGGATQGA AGATGAAATG AATGAAATGA AGQGAGAAGG GAAGTTTAGA 
36501 GAAAAAAGAA TAAAAAGAAA GGAGCAAAGC CTGCAAGAAA TATGGGACTA 
36551 TGTGAAAAGA COVAATCTAT OTCrGATnSG TGTACCTTGAA AGTGACQGQG 
36601 AGAATCGAAC CAAGTTQGAA AAGACTCTGC AGGATATTAT CCAGGAGAAC 
36651 TTCCCGAATC TAGCAAQGCA GGCCAACATT CAGATTCAGG AAATACAGAG 
36701 AACGCCACAA AGATACTCCT TGAGAAGAGC AACTCCAAGA CACATAATPG 
36751 TCAGATTCAC OVAAGTPGAA ATGAAQGAAA AAATGTTAAG GGCAGCOW3A 
36801 GAGAAAQGTC GGGTTACCCT CAAATGGAAG CCCATCAGAC TAAGAGCGGA 
36851 TGTCrreGCA GAAACrCTAC AAACCAGAAG AGAGTX3GGGG CCAATATTCA 
36901 ACATTCTTAA AGAAAAGAAT TTTGAACCCA GAATTTCATA TCCAGCCAAA 
36951 CTAAGCTTCA TAAGTGAAGG AGAAATAAAA TGCTTTAOAG ACAAQOWVT 
37001 GCTCAGAGAT TtTCTCAa^ OCAQQCOnGC CCTAAAAGAG TTCCreAAQG 
37051 AAGTGCTTAA CTTCGAAAGG AACAATCAOT ACCAGCGGCT GOAAAATCAT 
37101 GCCAAAATGT AAAGACGGTC GAGACTAQGA AGAAACTQCA TTAACAAAGG 
37151 AGCAAAATAA COWSCTAACA TCATAATGAC AQGATCAAAT TOVCACATAA 
37201 CAATATTAAC TTTAAATGTA AATQGACTAA ATGCTCCAAT TGAAAGACAG 
37251 AGACTOGGAA ATTCGATACA GAGTGAAGAG GGATGAGTGt GGTCTATTAA 
37301 QGAAAGGGAT CrGAGATGTA GAGAGAGAGA TAGGGTGAAA ATAAAAGGAT 
37351 GGAGGAAGAT GTAGGAAGGA AATQGAAAAG AAAAAAAGAG AGGGGTHSCA 
37401 ATCCTAGrcr CTGATAAAAG AGACTTTAAA GCAACAAAGA TCAGAAGAGA 
37451 OXAAGAAGQG GATTAGATAA TQGTAAAQGG ATCAATTGAA GAAGAAGAGG 
37501 TAACTATGCr AAATATATAT GCAGGGAATA CAQGAGGACG CA6ATTGATA 
37551 AAGCAAGTGG TGAGTCAGCT ACAAAGAGAG TTAAACTGGG AGAOVTTAAT 
37601 AATQQGAGAC TTTCAGAGOC CACrerCAAC ATTAGACAGA GGAATGA6AG 
37651 AGAAAGTCAA OVVQGATACC CAG6AATTCA ACTCAOaCT QCACCAAGCA 
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37701 GACCTAATAC ACATCTACAG AACTCTXXAC CCCAAATCAA OVGAATATAG 
37751 A I I IIIII C A GCACCAGVCC ACX5GCTATTC CAAAATPSAC CACATACTTG 
37801 GAACTAAAQC ACTCCrO\CC AAATCTAAAA GAAO«3AAAT TATAGOVAAC 
37851 TATCTCTCAG fiCCACfiGTQC AATONAACTA GAACTQWSGA TTAAGAATCT 
37901 CACrCAAAAC CGCTCAACTA CATCGAAACT GAAGAACCTC CTCCTGAATG 
37951 ACrACnOGGT ACATAAC)GAA ATCAAQGCAG AAATAAAGAC GCTCTTTCAA 
38001 ACCAACAAGA ACAAAGAOAC AACATACCAG AATCTCTX5QG AOGCATTCAA 
38051 AGCAGTCTCT AGAQGGAAAT TTATAGCACT AAATQCCCAC AAGAGAAAGC 
38101 AGGAAAGATC CAAAATTCAC ACCCTAACAT CAOWTTAAA AGAACTAGAA 
38151 AAGCAAGAGC AAACACATTC AAAAGCTAGC AGAAQGCAAG AAATAACTAA 
38201 AATCAGAGCA GAACTGAAGG AAATAGAGAC ACAAAAAACC CTTCAAAAAA 
38251 TTAATGAATC CAQGAGCTCG IIGM IIKiA AAQGATCAAC AAAATPGATA 
38301 GACOGCTAGC AAGACTAATA AAGAAAAAAA GAGAGAAGAA TCAAATAGAC 
38351 ACAATAAAAA ATCATAAAQG QGATATCACC ACCAATCCCA CAGAAATAOV 
38401 AACTACOVrC AGAGAATACT ACAAAOVCCT CTATGCAAAT AAACTAGAAA 
38451 ATCTAGAAGA AATQGATAAA TrCCrC3GACA CATACACCCT CCCAAGACTA 
38501 AACOVQGAAG AAGrTCAATT TCTCAATAGA CCAATAACAG GATCTCAAAT 
38551 TGTQGOAATA ATO\ATAGCr TACCAACCAA AAAGAGTCCA GGACOVGATG 
38601 GATTCACAGC CGAATTCTAC CAGAGGTACA AQGAGGAACT GGTACCAITC 
38651 CTTCPGAAAC TATTCCAATC AATAGAAAAA GAQQGAATCC TCCGTAACTC 
38701 ATTTTATGAG GCCAGCATCA TCCTGATACC AAAGCOVQGC AGAGAOVCAA 
38751 CAAAAAAAGA GAATTTTAGA CCAATATOCT TGATCAACAT TGATGONAAA 
38801 ATCCrCAATA AAATACPSGC AAACPGAATC CAQCAGCACA TCAAAAAGCT 
38851 TATCCACCAT GATOV\GTlGG GCTTOVTCCC TQQGATGCAA GGCrGGITCA 
38901 ATATACGGAA ATCAGTAAAT GTAATGCAGC ATATAAACAG AAGCAAAGAC 
38951 AAAAACCACA TGATTATCTC AATAGATCGA GAAAAAGCCT TTGACAAAAT 
39001 TQVI\CAACAC TTCATGCTAA AAACTTTCAA TAAATTAQGT ATTCATOQGA 
39051 TGrTATCTGAA AATAATAACA GGTATCTATG ACAAACCCAC AGCCAATATC 
39101 ATACTCACTG QGrAAAAACT QGAAQCATTC CCTTTCAAAA CTQGCAOVAG 
39151 AOVGQGATQC CCrcrcTCAC CACTCCTATT CGACATAGTC rnOGAAGlrrC 
39201 TQQCCAQQQC AGTTAGGCAG GAGAAQGAAA TAAAGQGTAT TCAATTAGGA 
39251 AAAGAGGAAG TOVAATTCTC CGTCTTTCCA GAGGACATGA TTCTATATCT 
39301 AGAAAACCCC ATTGTCTCAG CCCAAAATCT CaTAAGCTC ATAAGCAACT 
39351 TCAGGAAAGT CTQVQGATAC AAAATCAATG TACAAAAATC ACAAGCATTC 
39401 TTATACAC)CA GCAACAGACA GAGAGCCAAA TCATGAGTGA ACrCCOGTTC 
39451 ACAATTGCTA CAAAGAGAAT AAAATACOA GGAATCCAAC TTACAAGQGA 
39501 TGTGAAGGAC CTCTTCAAGG AGAAGTCCAA ACCACFGCTT AATGAAATAA 
39551 AAGAGGATAC AAACAAATTQG AAGAACATTC CATGCTCATG GGTAGGAAGA 
39601 ATCAGTATQG TGAAAATQGC CATACTX5CCC AAQQCAATTT ACAGATTCAA 
39651 TCCCATCGCC ATCAAGCTAC OVATCACmr OTCACAGAA TTOGAAAAAA 
39701 CTACTTTAAA GTTCATATQG AACCAAAAAA GAGCCGGCAT TQCCAAGTCA 
39751 ATCCTAAGCC AAAAGAACAA AGGTQGAGGC ATCATQCTAC CTCACTTCAA 
39801 ACTATACTAC AAGGCTACAG TAACCAAACC AGCATQGTAC TQGTACCAAA 
39851 ACAGAGAtAT AGACCAATGG AAOVGAACAG AGGOCTCAGA AATAAOGCCG 
39901 OVCATCTAO^ ACTATCTGAT aTPGACAAA CCTGAGAAAA ACAAGOVATG 
39951 GGGAAAGGAT TCCCTATTTA ATAAATGGTG CTQQGAAAAC TQGCTAGCCA 
40001 TATGTAGAAA GCTCAAACTC GATCCCTTCC TTACACCTTA TACAAAAATC 
40051 AATTCAAGAT QGATTAAAGA CTTAAACGTT AGACCTAAAA CCATAAAACC 
40101 CCTAGAAGAA AACCTAQGOA TTACCATTCA GGACATAGQC ATQGQCAAQG 
40151 ACTTCATUrC TAAAAO\CCA AAAGCAATQG GAACAAAAGC CAAAATTGAC 
40201 AAATQGGATC TAATTAAACT AAAGAGCTTC TQCACAGCAA AAGAAACTAC 
40251 TATCAGAGre AACAGGCAAC CTCOWVATG. QGAGAAAATT TTTGGAACCr 
40301 ACnCATCreA CAAAGGQCTA ATATCCAGAA TCTACAATGA ACPCAAAO^A 
40351 ATTTACAAGA AAAAAAACAA ACAACCCTAT CAAAAAGTCG GrTGAAGGACA 
40401 TGAACAGACA OTCTCGAAA GAAGACATTT ATGCAGCCAA AAAACACATG 
40451 AAAAAATGCr CACCATCACT GGCCATCAGA GAAATQCAAA TCAAAACCAC 
40501 AATGAGATAC CATCTCACAC CAGTTAGAAT GGCAATCATT AAAAAGTOVG 
40551 GAAAONAOVG GTOCrOGAGA GGATtnX3GAG AAATAGGAAC ACnTTACAC 
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40601 TCTTQGTTQQG ACnGTAAACT AGTTCAACCC TTCTOGAAGT CAGIGTOGCA 
40651 ATTCCTCAQG GATGTAGAAC TAGAAATATC ATTTGACCCA GCCATCCCAT 
40701 TACreOGTAT ATAGCCAAAG GACTATAAAT 0\TGCTQCTA TAAAGAO\CA 
40751 TCCACATCrA IGI I IATTCT GGCACTATTC ACAATAQONA AGACrrOGAA 
40801 CCAAGCCAAA TCTOIAACAA TGATAGACTC GATTAAGAAA ATCTOGCACA 
40851 flTACACCAt GGAATACTAT GCAGCCAtM AAGATCAGTT^ (^^ 
40901 GTAQGGACAT QGATGAAATT GGAAATCATC ATTCTCAGTA AACTATOVCA 
40951 AGAACAAAAA ACTAAACACC GCATATTCTC ACTCATAQCT GQGAATTCAA 
41001 CAGTGAGAAC AOVFGGAOVC AGGAAGGGGA ACATOVCACT CTGGGGACTC 
41051 TTCTCGGCTG GGGGGAGGQG GAGGGATQQC ATTGGGAGAT ATACCTAATT; 
41101 CTAGATGACE AGTTAGreQG TQCAGOKAC CAGGWX3CA CATGTATACA 
41151 TATOTAACTA ACCTCCACAT TGTCCACATG TACCCTAAAA CTTAAAGTAT 
41201 AATAATAAAA AAAAAAGACT CAAAQGOVCA GrCACTGACA OTTPGATTTT 
41251 TTATAATAQC TCltAAtTTT CCTAACtTQG AQGAi\GTTCA TAQCATCTTT 
41301 TGAGTATATT TOWVACTAC ATTCAAATCT TGCAATAGAA CATTAAGAAT 
41351 TATCTTCATG ATCCACTAAG TOGflOTGAAAA AAATGGATAA TCAATCrATT 
41401 CATTACXATC GTTTAATATr TTATCTtXIAA Gl M I IblGI 1 1 IblAGCTC 
41451 ATTXaGOVGAG TTPSACAGAG TQCPGAAAGr ATTCnTAGT GAGCTOQCre 
41501 TAATTTTTQG GCCCAI I I 1 1. ATCTAGATAA TTAAAACTAT CTCACAGGAC 
41551 GATAAAATQC TPGCnGCCAT TTCCAACAAC CTATAI I GGATGGGGTT 
41601 TTTTAATnrA ATGAGAATAT TATGTTAGAA AAGAAACTCT CATTCTCTAA 
41651 AGTCGCOVAT AAimAGn!: mTmiOV ATTTAGTTTT GTACnTCAT 
41701 CAI II 1 1 I lA AAATTTCAGC ATTGATGTTG ATQGGACAAT GACAGTOGAC 
41751 TGGAATGAAT QGAGAGACTA LI ICI lATTT AATCCTGTTA CAGACATPGA 
41801 GGAAATTATC CGrTTCTCGA ^AACATTCTAC AGTAAGTCTA CTTTATGrAT 
41851 TTATACTAT TPQGAGCrAT AAACCATAGG TAC(«JTTATC ACCCAAGAAC 
41901 ACrcrGTAAC ACTTATQQGC OVOGATAGCT GAGTCCO^GT AGGTCCTTAA 
41951 CCrGTAGAGr TCTATTTATT CTATTAGGGA TAGATTTATA GAGTATTAAA 
42001 CAAAAAAAAA CAGCTCTCCC TCTCCCrCTC CCTCrCTCTC CCCCTCCCCA 
42051 OGGTCrCCCr CtCGCTGTCr TTCCACGGTC TCCCTCTGAT GCCGAGCCAA 
42101 AGGTQGACrc TACTQCraX ATCTCXSGCTC ACnQOU^CGT CCCTQCCreA 
42151 TTCTtCreCC TOVQCCreCC GAGTCCCreC GATTQCAQGC GCGCACOQCC 
42201 ACGCCTCACJ Gill I ICGTA I 1 1 II I IGGT GGAGACGGGG TTTCGCTATG 
42251 T7X3GCC3GGGC TOGTCTCCAG CTCCreACOG aSAGTGATCC ACCAGCCTOG 
42301 GGCrCCClGAG GreCTOQGAT TGCAGAOQGA GTXnX2GTTCA CTCAGTlQCrC 
42351 AATQGTGCCiC AQGCreGQGT GOaiGTXSGCAT GATCTCXSGCT CGCTAO^CC 
42401 TCO^CCrCCC AGCGGCCTCC CrTOGCCTCC CAAAGPGCCA AGATTQCAGC 
42451 CTCTCGCCAG CCGCCACCCC GTCTOGGAAG TGAGGAGCGT CrCPGCCTTsG 
42501 CGQCCCATC3G TCTOGGATAT GAQGAGCCCC TCTCCCTQGC TGCCGAGTCT 
42551 QGAAAGTCAG GAGTCTCTCr GCCOjGCCGC CATCCTCTCT AQGAAGTCAG 

42601 OGTcrcrecc cqgccdgccca tcgtctggga tgtcaqgagc cccrcTXjccr 

42651 GGCreCCCAG TCTGGAAAGT GAGGAGCGCC TCTTCCCGGC GGCCATCGCA 
42701 TCTAQGAAGT GAGGAGCGTC TCTGCCCGGC CGCCCATCGT CTCAGATCTG 
42751 GGGAGGQCCr CTGCCOOGCC GCCCGGrGTC GGATGFGAQG AGGGCCTCTG 
42801 CrmSCCQCC COGTCTCAGA AGTCAQGAGA CTCTCCXiCCC QGCAGCCX5CC 
42851 CCGTCrOQGA AGTX5AQGAGC GTCTCCGCCC GGCAGCCACC CTCTCaSGGA 
42901 GGGAGGTOGA GGGGTC^GCC GCCCGCCC3QG CCAGCCACCC CATCCQQGAG 
42951 GreAGGGGTC CCrCTCCCOG GCCGCCCCTA CAQGGAAGTG AGGAGCCCCT 
43001 CraCCGGGGC ACGACCCCAT OnGGGAGGTlG TAC<:0\ACAG CTCAITGAGA 
43051 ACXiGGCCATC ATGACAATCG GGbl 1 1 IGIG GAATAGAAAA AGGGGAGAGG 
43101 TQGQGAAAAG ATTGAGAAAT CG6ATQGTTG CTGrGrCTCrr GrTAGAAAGAG 
43151 GTAGACATQG GAGACTTTTC Al 1 1 Ibl ICT GTACTAAGAA AAATTCTTCT 
43201 GCCTTCGGAT CCTCTTGATC TATGACCTTA CCCCCAACCC TGTGarCTCT 
43251 GAAACATGTG CIGIGICCAC TCAGGGTTAA ATQGATTAAG GGCGGTCCAA 
43301 GAIGIGLI 1 1 QCTAAACAGA TQCTTCAAGG CAGCAQGCTC GH-AAGAGTC 
43351 ATCACCACrc CCTAATCTCA AGTACCCAQG GACACAAACA CTQCGGAAGG 
43401 CCDGCAGGGTC CTCTGCCTAG GAAAACCAGA GACCI I IGI I CACIIGIIIA 
43451 TCreCTCACC TTCCCTCCAC TATTUTCCrc TGAOXTQCC AAATCCCCCT 
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43501 CreOGAGAAA CAC3GDVAGAA TGATCMTTA AAAAAAAAAA AAAAAAAACA 
43551 ACCCAAGACr GCATAAATGT CCATTCTCAA AACTTQGAAG AAGTACCACC 
43601 TPGATGAATA AQCTUrCTAG CmTATTCG CATTTAAGTA TTCTCCCArA 
43651 QOGAAGTCTA AAACrPCTAG GCmTACTT TtTAtAGGTA CTATATTCTC 
43701 CAAATAATCT CAGCACCTCA TQGTTQCrAA QGATCIGIGT C CI Ib l I I GG 
43751 TCAGATTATC TTTATCTCTIG'GCATAAGGCA CTTAACAATA TTCATTAAAG 
43801 GTTACAGAAT CM II IU.I I CATCrGOTA GCATTTCATA CGV G I I I GI I 
43851 TTCCACCAAA CTTTCAAATT TTCAI IGI II. CATTAATATT CrGCATACTC 
43901 ATCTAAACCA AGmCTATTA TTGTCCAATC TQCTCCFGAA ACCGTAQGA 
43951 ACTCrCTGAA GGAGmTAT TTATTTTTTG II I IIGII II IGI I I I IGI I 
44001 I IGI I 1 1 I 1 1 GAGACQGAGT CmGCrcrcr TGCCCAGGCT AGAGTCO«Jr 
44051 GCreCGATCr CGGCTCTCre CAAACTCGGC CTCCQOGGnrr CACjGCOVTTC 
44101 TCCTGCCTCA GCCAGGGGAG TAGCTGGGAC TACAGGCACC CAOCACFGGG 
44151 CCTOGCTAAT 1 1 1 1 M IblA TTTTTAlGTAG AGACQQQGTr TCACC3GTCTT 
44201 AGCCAQGATG OTCTCGATCr CCTGACCrnG TAATCGGCCC GCCTCGCCTC 
44251 CCAAAGrecr GQGATTACAG GCGTCAGCCA CrCTQCCCQG CCI 1 1 1 1 1 1 1 
44301 1 1 1 II 1 1 ICr TTATQQGCrr GTCnOACA CmCAGATTT GACTAAATTA 
44351 AATATGCATT AAATCAAGTC AQGAGTTCAC ATTCCCACTA GTAACAATQC 
44401 CTAAQCnAC ATAAAGCATT ATAAAATTUr TCGrGATTAG TCCCTTCTCA 
44451 GCTATGAGTA TAAGATAATA TTATACTAGTr AGTTCAGTTG CCTAGATAAA 
44501 TTCTACACTA TGTCAAGnT TATTTACATA ATrCTTACQG TAI I I I I lAA 
44551 GGTAGrnGAT AACAGrTGAG ACrAOVATTC TATOOZATT TTATTCATAG 
44601 TAAAATGAAG GAAQGGAQGG TTACTACCAt AGGAGAGCTC CrCCCOGTTG 
44651 CACrClTGCC TCTAAAAATT I I ICI GCOV^ AAOVATTTAG ATAATAGAAT 
44701 TGTAAAAATA TTATTATAGA Al IGI I IGTC TCAAACTATA GTAATGTAGA 
44751 ATAQGrPGAA QOGGrGATGA TTTCAAAOVV TACjCTCTCOV TTAGCTAAAT 
44801 TTTATATAGA ATCTATTQCA IGI I I lAAAt GATAAGTTCAG ATTTATAAAA 
44851 ATATmTAT AAACAGTAGG AAATGAGTTT AGGGGTATTC ACATACAGTT 
44901 TTAATTTTTA TTTACATATT TAAAACATAt CATQGTATAA ATATGATGTG 
44951 GATATAAATT TGAGATAAAG GAAGrATTCT TTAAGAATTG ATCAACTAAT 
45001 TTCTTAAAAG ATOTCATO^C OMbl IGbl 1 1. TCrAGGCFTA TGAAAAATQG 
45051 TTGCAATAAA AAAGATPGAG TATGATAAAA TQCTX5CCCTT TCAnTTAAC 
45101 CTAGACCAAG AGAAAACATA CrGTCAATCT ATGATGAATG AAAGAAACTT 
45151 GTAACTOTTC Gl I I IGIATA TTrcTAATTA CIGI I lATTT TCAI I I Cilia 
45201 TGAACTCATA CrcrACnTC TTCATTCreA GTAGACAACT TATAATCTAT 
45251 GTACrCAAAT TQGTTTAGrA TAAATTGrAG QGAATGAAGt TCATATTAAC 
45301 TCTAAAATAA OVTGAI IGI I CTCTAAAACA AAACGTCTTC TQQGATTATT 
45351 TTTAACTAAG GCGCATQGGG ATCI I I I 1 1 I CAI I I i lACA QOGAATTCAC 
45401 ATAGQQGATA GCTTAACTAT TCCAGATCAA TTCACGGAAG ACGAAAAAAA 
45451 ATCCQGACAA TQGraGAQQC AG C I 1 1 I G GC AGGAQQCATT GGTCGTXSCTC 
45501 TCrCTCGAAC AAGOVCTGCC CCTTTQGACC GtCTCAAAAT CATGATCOVG 
45551 GTGAGCTTTA TTATCGTCre TCCAGGTmG CCCTAAATAT TCTAAAAOVA 
45601 T GAGAAA TGT GGIGCi I IGA AAAAGAAGIT TTAAAATTTC TCAGTAATAA 
45651 TCTTTTATAC CCTAAAAAAT AAATCTATTT TGrnQCTCTT AACTCTAAAT 
45701 TCAOTCCATC TAAGfATQGC AGTCTACCAA ACCTTAAATT GTTAGTACAT 
45751 GTUTUTAATC AACi I I lAAT CTTPSGGATT CTATGACTAT TCAAACATTT 
45801 AATTCAAAAA ATATCrCTAG CTATTGTTGT AGGATTCTCC TGATTTATAG 
45851 TTTCCI ICI I TTTAATATAC TTTATCAAAA CTAAAGTATT TTTGAAATCr 
45901 AGACrCTTAG AGOVGCAATG TAATriTGAA AATTATTCTA AAQCTGAGGr 
45951 TAGCAGAAAA AGATGnOQCT TTATAGACTG ACtTTCCTAT TTACTAQCAG 
46001 TGTAGCATTG GGCTGGCCAG AGTX3GAAAGA GQGAATCGAA AA6AATTAAT 
46051 ATCTATTPQC TCACTCTCGr AACCCAGrTA ATCCTPGCAG CAGCCGAGrTC 
46101 AA GTAGG TAT TTTATCATTT TTCCAQQQQG AATCTGAQGC OCAGAGAATT 
46151 GACTTTTCCr TTACAACAAA TGAGAGGQGG AATGCAGTAT CTTTGCCTCC 
46201 AGrecrCCTG GTTCTCATGC TGCATGAAAC CTCreAGGTC TCATTTTCCr 
46251 TCATTCTQGG ATQGGGATAA GAATATCTAA TAAGAATQOT TTAAGAATCA 
46301 AGCAATATCA QGTATCTGAT AATGrrCTXSGT ACACTOGAAT AACCTATPOG' 
46351 AACATAGTAG IIGIilACAA AATAI 1 1 I lA AAACIIIGI I ATAGTTATQG 
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46401 TCAACACTTT TTATATrTCr CTCrAGATTT CTCTACAAAA AGATTCTSAC . 
46451 ACIGI I I lAA GCCAGCATTC OTCAGAATG TACCCAAATC TCAAAATTTA 
46501 TTTAGQQGCA AAGCTAATCC TTTAAAGAAA AAQGAGAQQG GAI IGGIGIG 
46551 IGIIIIICII TAQGAACAGT AGTAACTTCA CTTTTAGAGA ACTTGAATAA 
46601 GCATTTATTT TTTCCI I Ibl CCTATTtTAT TCTGAAGTTT ATTTATTTAA 
46651 AATAAAATCG ATTTtTCTCG AATrTAGrTT CTGCAAATTT GAGGAGTTTC 
46701 CAAAGTCAAC CTTCAQGITT GATACTTCTC TAGAAAGACT CACATAACTC 
46751 ACTCAAAGCT TATTACCXiCJ QGnATCGTT TATTAGQQQG AAAAGATQCG 
46801 GATGAAAATC AGTCAAGTAA AGAAGCACAT AGQQCA6AGC HUG I I G I C 

46851 crcrcccrcr ggagtctcca igici iagt tccrsqcact GTiATGnsGC 

46901 ACTAGGCATC GAATATTQCA GACCAACCAG GGAAGCTCAC CTCAGCCnT 
46951 GGTXJPGCAGA GTTCTTATTG GGGCLIGI 1 1 TCATACTGGC CACATQQCTX; 
47001 GCCTTCAGAA TTOVACXICXn" TCTGreAGre TGrnGTGTCFG IGIGIGIGIG 
47051 TGTUrcrcrc TCTTTACTOG TAGTCAOCCC TrTTATGTCA QCTCAAAGAA 
47101 TCAGAAGAAT AGGTCATTTG TTTAATTATT I I IGGIGIAT TGGACTTAAT 
47151 CMjI III lAT CTXJTAGGTCG TCATAAGGTA OVGrATTTTT AAGTGACTAC 
47201 CACATCrGTA GTATAAGCCA AGiAATTTAT CAGTACTCAC AGGATGGGTA 
47251 CATCTTCTAA TGAATTTATT GCCTAGAGAG GGCCTCAAAA TATGCCAAAG 
47301 AQGGTXXIAAT TTTTATTTTT QGTTTCAGGC TGTATGCATT CCAGTCTTCG 
47351 TAGCCCTCAT ATACACAATA TCCAAACCAT TTCAGACCCA TTTACAGTTC 
47401 AUCTCreTAC TALIICI IGA QGAGAQQGAG TAACATATTA CTTTAAATTA 
47451 TATCTAATAA TATACATACA TTAAATTATA TCTAATAATA TAATATTATT 
47501 ATTTGCAGrA TALI 1 1 1 1 lA TTTCCCTTTA ACTGAGOTG TTCATCTTTC 
47551 AAAGGGrGTT CGATTGCCTXS ATAGATAATT' TAGTtAATAT TATCTTATGA 
47601 AGGI IGI ICA TAATTTTAAT ACTLI ICI IG ITCTTCTCTCr CTGCnTCrC 
47651 ACACTGAAGA TACCAATTAT TCTTAGrmT AGAGTCAGAG AO«3GCCTCr 
47701 AAAATCATQG CAAtACTCCC TCTCATCArT ATATAtATTT TTCAAOTrT 
47751 CTATATTTTA I I I ICAAATA TAtLI ICI IG CAGTTAGAAA CGGTATTGAA 
47801 AAAGATTGTC TQGI IGIICT AGAAAAAGTA ATAGTAATAT GCCACCAGCA 
47851 TTTTATATCA TTLIGCI I I I Al I I I lAGGT TCACGGTTCA AAATCAGACA 
47901 AAATGAACAT ATTTCGrOGC nTCGAGAGA TQGTAAAAGA AQGAQGTATC 
47951 CQCTCGCnt QGAQQGGAAA TQGTACAAAC GTCATCAAAA TTCCTCCTGA 
48001 GACAGCTXJTT AAATTCTCGG CATATCAACA GGTAATTCTT ATCACCCXmG 
48051 GAATTTATTA ACAAAGAQGA GrTAGTAAAC GGATTXIAATA AATCTTAATG 
48101 TATAATGCIT TTQQGATTCT IGM I IAATA OOGATAATC TTTCACATAT 
48151 ACCCCATAAG GAGGATCACT TATAQGAGAT TAGACTAAAT AAAATCAGAG 
48201 ATTTCrCATG ACCAAOTTAT GGGATTCTTA ATTCATCATA TTATTTATAA . 
48251 AGII I II I I I. TTCrAAGTAG TTCTTAAAiSG AAGGGTAGAA TTTTAGTTTA. 
48301 TTCATTCTCA ATCCTX3AGCA GAAGO^GCAC ACTAACATAA GmTATCAA 
48351 AGTCTCACAA TCTAACCTO: QGAAQGAAAA CTATAAGTTC AA6TCCTTTC 
48401 TCTAATTTGA CGTTCCTCTA AAATTGAQCT GAGTTTQGAG TGACACCTCC 
48451 ATGAAQGCAG GGGGGreGCT TCTTCCCCAT GTACTCCAGC ACCTAGAGAG 
48501 AGCmOGCAT GTCATAAGTT TCAAGGGAGT GTTGAATGAG TCAATGAATG 
48551 AACAAATGCA TTTACCTCTG AATCACTTCT CTOTCGGCrT I IGI lAACTT 
48601 QGATTATTTC AGCTATTCCr TCAGCCTAAC TOVATCTAAA QQQGAAATAC 
48651 AGAGGTAAGT TTTAGAGrTT QQGTTCTaT TATGGTCATT AGCAGAACTG 
48701 TCTAGrrGAG CAGCCACAGA TTATGmTC CATTATTTAT TCCATCATTG . 
48751 TTTATCAAQG ACrGTAAOGG CGTGAAATT CAACTCCCCC CCCCATAGTT 
48801 I I IGIATTAT TCCATCTAGA TnTAGATTA TTCTOGAGAG IGI 1 1 IGI IC 
48851 TTCAQCAAOV GAATACTCTT GAGAAGATTA CXSAAGTCXyVG TOGTATCCTT 
48901 TTOTTCCCr AGGAAATAGA GAAGCAAAAA AAAAAAAAAA AAAAAATTAA 
48951 AGAAAA tCTA GTCTCCAGGA TTTTAATTAG AACCTATCCT TQQGAAQGCT 
49001 ATTTTCOTA TATGAAQGTT TGAAGATTCA AATCATGATT ATTAAGQGCT , 
49051 AAIGI I IGAG ATACCCTTAG GnATTCTGA CCACATACTT GGATTTTATG . 
49101 AtAQGAAAQC CACAGCCTAA AATAAATAAA TACTGAATGG AGTTATTTCA 
49151 GTATGCAAGA AGrmOGTAT nTTGAAAAA GTCCATOGGT ATPGCAAGCA j 
49201 AATATQCACA 1 1 1 IGLI I lA TGCGM I IGI CAGATTCITA CCnGGATAC 
49251 CACdVAOVQG OVTOCTCTGC TTOGTCCAC CCAAGCTCCf TGCTGAGAGC 
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49301 TGTTATAGT ATTCTCATTT CTCCACACTA ALI I ICI lAG AO^TCAAGAG 
49351 AAAGCTCTCr ACAG^GTCTC GTCTAGmT CTTATQQGCr CTGGACCrAT 

49401 GGrecrcnr Tcrcrccrcc tqctgaaggt ccattcatcc ctgqqgqctc 

49451 TGTAAAAQCX: AO.! ICCIGI GACAAQCATA TACTAAGCAT CTCAATCAAA 
49501 GCCAGTTCCr CCCCTXJTCCA GCCTCCCTCG AGKCTGAAT TGCAGAATAT 
49551 CCCAI I ! I IC ATTQGATCAT GGAAAACCCA TTCTTTTCCC AGTGGATTCr 
49601 AAATTACrrc GGQGTAAATA GGCTCTATAT ATTCTCAAAT TTCCOVGAGr 
49651 ATGTAACTAG GTCACmTA GATTCAGATA GATTTTCTTC CTTCAATAGC 
49701 TAGTACTTTA QGAAACTAAG AAAAAGATCT TTTCAACCTC GTATGnrAGCT 
49751 CPGTCAAACA CATCATCAGT ATQGGGTAAA CXTCTUTTCT CTCTOGGrnG 
49801 TCATTACCAT AGTAGTUTCA TTCTATCATT GACAGTCTAA TAGTCTCQOG 
49851 TAGIGI ICI I GTOCTTTCAG CreCCACTCT GTACreACFG CnTCCACTC 
49901 OAACATCmC CTCTTTATCT CAACACTCTA QGIOACCrG TOTACrcrGT 
49951 GTTTCAGCAT CTCTQCTTCC ATGACCCAQG AGTGCrrCCC ACKAATATG 
50001 GCCACCATQC ATGCTOVrCT TTCTCCTACJ CCCFCTCTCC TCACCCTQCT 
50051 CCAGCAACAC AGACAGACAC CCrTCGTCTT TCTATATCTC ATATQGTCQG 
50101 GAATGCCGT TAGTACTTAC TONGGAGHA GTTCCTCTCG GAAGCXTTCr 
50151 GrTCTAGTTT CCI 1 1 101 lA CAGCACTTTC ACATTCAATT CFSACGITCT 
50201 CTCTAOTAT CIGCI I IGlCi AGACTXJTCAG CTTCCrrAGG CAGTAGCTAC 
50251 TTCTATTCTT AGCACCTPGC CCAGTCCCAG GAAACCCTTA TTAAGTAAAT 
50301 GAAAAGACAG AACTGACAGA CTQGAATTAG AGCTCAAGCT TGCCTCAATC 
50351 TCAAGCOVtT AAGATCAAGG QGAGCCQQQC GTCGTXSGCTC ACQCCrCTAA 
50401 TCCCAGCACr TrAQGAGGTA C^l I IGLI IGA GCCCAGGAGT TCAAGACCAG 
50451 CCTGGGCAAC GTGGCAAAAC CCCATTtGTA CAAAAAATAT AAAAATTA6T 
50501 TQGAC3GTCGG GGTCTGTCCC TCTACTCAGG ATGCTGAGGT GGGAGGATCA 
50551 CTTGAGCTGG AGAGGCAGAG GrraZAGTCA GCTCQGATGA CACCATTCCA 
50601 ATCTAGCCTG QGTCATAGAA TGAGACCTTC TCTCAAAAAA AAAATAAATA 
50651 AATAAATAAA GQQGAAGATA AGGATTQGAA ACAGAAGGAG OVGCATGTCG 
50701 ACAGAAATGT AQGCAG\AGA AGGCATOVCT CACTGAAGAG ACTGAAAGTC 
50751 GTTCACrcre CCrCAAGACT GGTGGAGTCT GTTTCCGGAA AGATAATGAT 
50801 GAAAGAGCFG GACAGATAAA CfiCGGOQCAA ATCTAATAGG AGTCrOGATT 
50851 TTATTCTGAA TATOGTAQOG GCTATTCTAG CATCTTATAT AQQGAAGTCA 
50901 AATGAGTACA TTCACATTTA AGGAATATCA ACCTGAAAAA AGAGTCGAGA 
50951 CATTGTTQGG GGAGAGTGAG GTAGACTAGA QGCAGGGAGA ATATTTAAAT 
51001 AATTGAGGTA AGAAATGATG AAOVCCAGTA TAAGGTCATG TCtTTAAGGA 
51051 ATGGAGAAGG GAATGAACPG AGAAATATTT TCGAAGTAGA ATCAACAGAA 
51101 CrCACTGACr GACTGGATAT GGAGGTGAGA AAGAGAAGAG TCAAGAATGA 
51151 TATTCTAATT TCTAAOTGA GTGACTGCAT TCAAAGAGAA TACAATATOA 
51201 QGTTCCATTT TGTCCATGCr GAGITTGAGA TCTCraOGAC ATCTACAGGG 
51251 AGCrenCCAG TAAGCAATTC GGTATATCAG CTAGCCATTA AGAGAGAGAT 
51301 CTTTGATAGA GAGGrTCTTG CreAGTPGAG CCATTGGAAT GGGCAGGATC 
51351 ACrCAAGAAG AGGTATAAA TGAGAAGAAT TCTAGGAATA AGTCCAAAGG 
51401 GAGAAGTAAA AGAAGAAACT TGCAAAGGAC ACTGAGAAGA AATAQCTCGA 
51451 GGGATGGGAG AAAATCCAGA GAGAGGGATG GCATAGGAGT CAGTQGAAGG 
51501 AA AOGGT TTC ATQQQQGTCA GTACTACraG GTAGTGAATA TAATAAGAAT 
51551 ATCI I I lAQG ATTfCTCAAC CCAGAGATAG GTAAGCTTAG TATAAATGCT 
51601 TCTUreAAGT AATGAAATCA GAAACCATGC TGAAATGAGC TTAAAGTCAA 
51651 TQGGAQGTX5A AGAAACTTCG ACAGTAGAGA CAOVI I I I lA QGGAGTTnGA 
51701 CAGTGAAGAG AAGGAAACTA GAAGAGGGAG AGGGTWAG ATAAGAAAGA 
51751 TCmOQGFQG AQQQGATTTC I IIIH I GII 1 1 1 1 I G I 1 1 1 1 1 1 I CIibI 1 1 
51801 GTATbl I ICjI TTCTTTTreA GATOGAGTCT CACTTTATCA CCCAGGCTQG 
51851 AGTAAAGTOG TQCfiATCTCA TCrCACFQCA ACCrCTQCCT CCTAGGTrOA 
51901 AGreATtCTT CreCCTCAAC CrCCTCAGTA GnNNNNNNN NNhJNNNNhB^ 
51951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
52001 NNNNNNNNNN NNNNNNNNNN NNNNNlXXiCT CAGCCTCCCG AAATGCPSQG 
52051 ATTGCAGGAG TGAGCCCCCC GTGCCTXjGCC TG6AGQ6AQG ATTTPGATTT 
52101 GACTTTAATC TQCCTCTTGC TGAAGGAAGC ATGTCAATAC AAATAAAGAA 
52151 GTTGAAAACA TAGGTAAGAG AGGnGATTA AQXGGTAGG IblllCAAGG 
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52201 GAGrrrcrcr gtagggaaag ggagtcqgag atqgaaagqg gctx3C5qqgag 

52251 ACAGGTTCTA TCCAGAGACT GnTTAAAAQGA TTAGTCTTTG ATTACAAGAA . 
52301 GAACTCTTCr TATAOCTCTT TQQGAAGAAA AAATATGTCA CTAGCTATQG 
52351 ATAATTTTOC AGGAQGTOQG O^GAATAOCA AGATATTCTS OCTQOTQQCC 
.52401 TCTCTACTCT: TCOTCAGCT CCTCAGAAAG GATCPGATCr GAGAATGAGG . 
52451 GAGGAAGTGG TATTOGAAGC TCGAQGAGAA TQGAGAAGAT CAAAATQGTT 
52501 AGTCT AACAA A7GQGA6AGA ACTCAGATAG AO\AAAGGAT TTOVGGGTGG . 
52551 TTTTGAQOGC TCAGrTAAGT CTCCTTTAQG . AAQGITCAGT TCTCTAGCCT 
52601 TQGONAGnA aTAAAGTCT CTUTCACTAT TACTKATXT CTAAGATQGG 
52651 GACTAAGCTT GGTCACATAG TTTTACATAC CAGGCACAGT GCCTGACTTT . 

52701 TTOGcrcrcr cctgaagtct tccli i igia tatcgtatgt ttgogggaat 

52751 AGGAGCCrCA AGCAOTATC CTTTAAATAT TTATCCTCCA TCAGTOACTA 

52801 AAajnTAcr CTCTAcmr gataggtoct gtgggqgtcc agggtataaa 

52851 AGGrACCTTC AAAGTTACTG TTAAAGPQO\ GGAAGGTTTT TAAGCAAATT 

52901 ATGTTTAATG ATnTGACAA TCTXJACATCC AGGAAAATTA ATAGGGGCTA . 

52951 TQO«5AAGAG GAGTTTTATG TAACACTCTG tAGnUVQGA AACAGAQCCC 

53001 TTCGAAGQVG TGATCTtTXT QQQGAQGAAT GTCTOGTATT TQQGAATCTC 

53051 ATGAAATCAT AATATACTTA AI I I I lATCA TGAGCAGCAA AACAGAGATT 
.53101 TGGTAGGAGA AAGTCATCGT ATGTTGTIGC ATTGGGCACr.TTAGATCCCA . 

53151 GGGAACAGAA ACPQGCrQGC ACAQGAATCG GCATCACTGT GGQGATGGAT . 

53201 CATGTAGQQG AAGGATCCCT QGAGAAGTCG AQGAGGTGAG ACITCCCCCr . 
.53251 tCCCrrCTCC ATGCATGAGT CCACI ICICI CI G I I G ACTT TCCCj L I i g i c 

53301 ecrCTGGTGA CAGOAGCrcC ITACCTCTGG AGACCCCCTC AGATTTCTGA . 

53351 GAGAAGGAAT CTGGCTTQCC TQGGTAATTC CGOGGTCrA I G I HG GGCA . 
"53401 GAAIGICI lA GCAAGTTCTG TAAAGATAGT GTATTCATAT ATTAATAATA 

53451 ATAATAACAT CTACTGAACA TTTCCTAGGr GTTCAGACXn: GGAGTAACOG 

53501 TCTTACAAGr ATTAI M 1 1 1 TGrTAATCGt TCCATAACEC TCTCAGGTAA . 

53551 GTAGrcnAT CACAGACAAG GAAACCAOVA TCTGGACCTG tTOVTGAACT . 

53601 TGCrCGAGGC CAGGTiQGCrC TQGAGTTCCA GCTCAGGTCT GCCTGACTCT 

53651 CAATCCCATC ATATTAATAT ACTTQGCCAGT CACTATTTre GGTGrATfQG 

53701 GGTCATATTT ATACCCTPOG TCCAGTTAGC TATGTTQGGT CAOTTAGrA . 

537,51 CTGATAQCCA GQGAGATQCT GGGCTPGATA GGTTAGTATA ATTCTATCTA . 

53801 TTACOACAA AAACIGI I I I, TATAAATTUr I I IGI IAACA I I IGI I IGIC . 

53851 ACCTATTTAT TCATTTTATT TGCACTX5GTG AAAATAAACT CATCmtAA . 

53901 AAAC rGTOGG GAAAATATCC AAACATPGTC AAAACTTGAT TAACCTTGTA . 

53951 I I I ICIGIAC ACCTCGGGAG GGATGCTXJTT ATQLIGI I IC AGCAAAQGAG 

54001 CAAcrroGrc CAATCTGQGA GACATCrGre l l l igiggaa atctgaotg 

54051 AAAACCACrc TCCAGTCACT GCGPGTATTA GCATTTAGGC CnGCTCTTC . 
54101 TGCTATCTAT TATTAATCTA GrGTATACAT TTGGAGACAC ATOVTCACAT . 
54151 TTCTGAATIT ATrcATnCT AQGAGCTCAT TTCrATTCTA QGATTCTCTA 
54201 GrnQGOTOG GCTGCCATAA AATACCACAG TGIGIGIGGA ATCAACAACG 
54251 GAAATTTATt TCTAACAGTT TCAGAGGGQG GAAAGCCTAA GATCAAGQGC 
54301 CAAGCCAGTT TGATTTCTAG TGAGGGTTO CTTCTCAGCr TGTAGACAGC , 
54351 TQGTATCTGC TCAOVTGCTC 1 1 i ICI IGGT GCACATSTGA AGQGQGAGAG . 
54401 AGAGAGTCQG CTCTOQGTC TCreCTCTTA CAAGAACACT GATCCTCTCA 
54451 TX3AGGGCTCC ATCCTCATGA CCTCATAACC CTAATTACCT CCAGAAGCCT 
54501 CATCrCCTAA TACCATCACA TGGGAGGTTA CAQCITCAAC ATATGAATTT 
54551 GGTGGGGGTG CAGCTCAGTC CACAGCAQGT AGTAATCTCC ATTTTAAAAC 
54601 ilGIIIATAC AGTACAAGAA GnAOTACT GAAGAAQGAC AAAAAATAQG 
54651 AACATT TGAG AGATTTATTT CTGGTTCCAT QGCTOGAGCA ACTGOVCAGA 
54701 CmTATATA TCCAATQGAG GTGAGTACCA TTCTCAAGTC TGACTXHOTG 
54751 ATQGTCTTQG Ibl IGbl Ibl CTATPGCrCT CTAACAAGrT ATCCCAAAAT 
54801 TAAO«mTA AAACAAGGAT TTATCATCGC ACA fa l I I C I C TQQGTCAGGA 
54851 ATCPQGAAGC AGClTAGCre QGTGCCrCTG GCTCAGGGTr TTTCACAGCC 
54901 CACAGTCAAG ATOGTAGTCA GAGCnGGAA TCAGCTGGAG GCGGATTCCA 
54951 AGCrCACrCA IGI IGCIGCC AGGCCTCACT QGCTATTCGC 7GGAAAGATC 
55001 AGTTCCnAT CACGTGAGCC TTTCTCTAQG CTGGCTGAGT ATCCTCAAAA . 
55051 CACAGTAGCr GGCrTCGCTA GAGTCAGRiG TCCAACAGAG AGAGAGAGAG 
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55101 AGAGTOCCTA AGATCAAAGC TQOTATCrTT TQCCTXnTCT GCTCTATTCC 
55151 ATTWCACA CAGACOVACC CTOGTAGAGT GTAGGAGGQG CTCCTATAAT 
55201 QGreTTAATA ACOiGAGACA AATATCACFG GQQGRACTT TAGAGGCTQG 
55251 CreCjOVOTT AGAQQCreGC TCCCATTOCr GTCCAAAGAG 1 1 IC I GIA CC 
55301 ATAAATTTAA TAATGGAATC TONGGATTTC ATTATATQGT GATTATCCTA 
55351 ATTAGACATC CTITCATTAG TGCATAGGIT GGCAAAACAC AGACCTAGGG 
55401 ACrcnrCAT ACAGCCOTG ACCTAAGAAT GCCTTTTACA rmTAAAAA 
55451 GTX^GGCAACA CAGGAAAAAG TGAGAAAGAT CTAAAATGGA CACCCTAAGA 
55501 TCACAATTAA AAGAACTAGA GAAGCAAGAG CAAACAAATT CAAAAGATAG 
55551 CGGAAGACAA GAAGTAGCTA AQGTCAGAGC AGAACTGAAG GAGATAGAGA 
55601 CAC3GAAAAAC CCTTCCAAAA ATCATTGAAT CGVGGAGCTG II II lATGAA 
55651 AAGTTTAACA AAATAGACAA CTAGCOVGAA TAATAAAGAA GAAACOVGAG 
55701 GAGAATCAAA TAGCCCOVAT AAAAAATGAT AAAGQQGATA TCACGACOV\ 
55751 TCCCACAGAA ATACAAACTA CCATCAGQGA ATACTATAAA CACCTCTATG 
55801 CAAATAAACr AGAAAATCTA GAAGAAATCG ATAAATTCCT QGACACATAC 
55851 ACGCTCCCAA GACTAAATCA GGAAGAAGCT GAATCCCTCT ATAGACO^AT 
55901 AACATCTTG" GAAATTGAGG OVGrAATTAA TAGCCTACOK ACOVAAAAAA 
55951 ACCCAQGACC AGACAGATTC ATAGCCGAAT TCTACCAGAG GTACAAAGAG 

56001 gagctcatgc CATTccrrcr gaaattattc aaacaataga aaaagagaga 

56051 TTCCrCCCTA ACTCATTTTA TGAGQGCAGC ATGVTTCTGA .TACTAAAACC 
56101 TQGCAGAGAC AO^ACOVAAA TAGAAAATTT CAGQCCAATA TGCCPGATGA 
56151 AOVrOWnGT GAAAATCCrc AATAAAATAC TCGCAAACTC>AATCCAQO«3 
56201 GACATCCAAA AGnTATCC^ CCATGATCAA GTTCGCTTCA TCCCTGQGAT 
56251 GCAAGGCrcr TCAACATATG CAAATCAATA TAACGGAATT CATCAATAAA 
56301 CAGAACCAGT GACAAAAACC GCATGATTAT CTCAATAGAT GCAGAAAAGG 
56351 CCTtC3GATAA AATTCAACAC CACTTCATGT TAAAAACTCT GACTAAACTA 
56401 GnATTCATG GAATCTATAA OWWTAATA AGAGCTCTTT ATGACAAACC 
56451 CACAGCCAAT ATCATACTGA ATQGQCAAAA GCTGGAAGCA TrCCCmnGA 
56501 AAACCQQCAC AAGACAAGGA TCrCCTCTGT CAGCACTCCT ATTCAAOJtA 
56551 GTATTQGAAG TTCTGGCCAA GGCAATCAGG CAQGAGAAAG AAAtAAAGCG 
56601 TATTCAGATA QGAAAAGAQG AAGTCAAATT GrCTCrGTTT GCAGTrGACA 
56651 TGATTCTATA TTTAGAAAAC CTCCTTCTCT CAGCCCCAAA TCTCCTTAAG 
56701 CTGATAAGCA AGTTAAAGOX AAGTCTCAQG GTACAAAATC AATCTOCAAA 
56751 AATCACTAQC ATTCCTATTA ACCAATAATA CACA(\ACAGA GAGCOWVfC 
56801 ACGAGTCAAC TOCCAtCCAC AATTCCTACA AAGAGAATAA AATACCTCQG 
56851 AATACAACTT ACAAGQGATG TGAAGGACCT GTTCAAGGAG AACTACAAAC 
56901 CACrCCrCAA GGAAATAAGA GAQGAOVCAA ACAAATQGAA AAACATTTCA 
56951 TGCTCATQGA TAQGAAGAAT CAATATCATA TCATAQGAAG AATCAGPQGC 
57001 CATACrcCCC AAAGTAATTT ATAGATTO^ TGATATCCCC ATCAAQCTAA 
57051 CATPGAATIT CTTCACAGAA ATAGAAAAAA- CTACOTAAA TTTCArATGA 
57101 AACTAAAAAA GAGCCTCTAT AGCCAAGACA ATCCTAAGCA AAATGAACGA 
57151 AGCTGGAGGC ATCACGCTAC CTX5AGTTCAA ACATACTACA AQGCTACAGT 
57201 AACCAAAACA GCATCGTACT GGTACCAAAC AGATATATAG ACCAATGGAA 
57251 CAGAACAGAG GCCTCAGAAA TAACACCACA CXJrCTACAAC CATCTGATCT 
57301 TPGACAAAAA OVAQCAATQG QGAAAGGATT CCTTATTTAA TGrATOGTCT 
57351 TGGGAAAACr GGCTAGCCAT ATQCAGAAAA CTGAAACTOG ACCCCTTCCr 
57401 TACACCITAT AAAAAAAAAA TTAACTCAAG ATAGATTAAA GrCTTAAAOV 
57451 TAGAGTTAAA CTATAAAATC CCTAGAAAAA AACCGAQGCA ATACCATTCA 
57501 QGAOVCAQQC ATQGAOWVG AOTCATGAC TCAATCACAA AAGCAAtQGC 
57551 AACAAAAGCC AAAATTCACA AATCQGATCT AATTAAACTA AAGATCTTCT 
57601 GCACAGCAAA AGAAACTATC ATCAGAGTCA ACC3GGCAACC TACAGAATQG 
57651 GAGAAAAATT TTCCAATCTA TCCATCTGAC AAAGGGCTAA TATCCAGAAT 
57701 CTATAAQGAA OTAAGOWV TTTACAAQVA AAAAAAACCC ACCAAAAAGT 
57751 GGGTSAOGGA TATGAACAGA CACTtCTCAT AAGAAGACAT TTATGCAGCG 
57801 AAO^AACGTC AGAAAAQGCT CATCATCCCT GGTTCTTAGA GAAATCCAAA 
57851 TO\AAAGCCC AATQGCATAC CATCTOVGGC CAGTTAGTTA AAAAGTCAQG 
57901 AAACAACAGA TCCTGGCAAA TATGTOGAGA AATAGGAATG OTTTACAa: 
57951 GrnGGTGGGA GIXTTAAATtA GTTC^AGCAT TCTGGAAGAC AGtUTOQCAA 
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58001 TTCCrCAAQG ATCTAGAAOC AGAAATACC3G TrTGMiCCAG OWTCCCATT 
58051 GCreCTTATA TACTCAAAGG ATTATAGATT TTTCrACTAT AAAGACACAT 
58101 GCAO^CGTAT ATTTAT7GCA GCACTCTTCA CAATAGCAAA GACTTCGAAC 
58151 OVACrCAAAT GCCOVTCAGT GATAGACTAG ATAAACAAAA TATQGCACAT 
58201 ATACACCATG GAATACTATG CAGCCATAAA CAAQGATGAG TTCATGTCCT 
58251 TTGtAQGGAC ATQGATGAAG CTQGAAGCCA TCATTCTCAG CAACCTAACA 
58301 CAGGAACAGA AAACCAAAGV CCACATXHTC TOVCTCATAA CTTGGAGTTG 
58351 AACAATGAGA ATACATCGAC ACAGGGAGGG GAACATCACA CACTOGGGCC 
58401 IIIIIGGGGA TGAQQQQCrA QGGGAGGAAT AGCATTAGAA GAAATAGCTA 
58451 ATGTAQGrnGA CAQGrPGATG GGTGCAGCAA ACCACCATQG CACGTXJTATA 
58501 CCrATGTAAC AAACCTQCAC GTTCTGCACA TGTATCCCAG AAOTAAAGT 
58551 ACAATTTTTA AAAAGTAQGC AAAAACAAAA GAAAAGAAAA GTAATATACA 
58601 ACC3GAGACCr AATATTTTAG GOTQCAAGG ACAGATATTT TACTATTTAG 
58651 TCTTTACAQG AAAAGTTTrc CAACTACTGC TTTATAGCAA AAATAATATT 
58701 GTAGATCTGG AATTTATTGA TATAGCA6AG GGGI I I I lAG TAACTGATCA 
58751 CTTAAGCAAG ATAAATACAA TTTTCACOGA TATCrSGTAT GCATGCTAAT 
58801 AG^GLI II 1 1 TTAAGCATCr TAATATCATT GITTATATTA CTCCACACAC 
58851 CTCrOWVAA AACTTAATAC CCrATTTTTC CTCTCATATC CTCCOVTATC 
58901 AGTTAATAGT ATCACCTTGC CAACTCCCCA CTQCGCOVTC CIGIGI ICCA 
58951 AGCTAGAAGT ATTQQQGTrA TCOTTATAC TACCATTTCC GTCACCTTCC 
59001 AGATQCAGGT GGTCACCAGT CAL\ 1 1 IGI I AAGAOVTOVA TAGATTATCT 
59051 TGCTTOIATT tCLI KjGICA CTTCCrrCAt CAGATOna: ITCCAGTAAA 

59101 asGGTcrcrc TOGCrmSGT cttagcgccc caatagaggt aataomgaa 

59151 AGAGAATGTA TCAACAAATT GTACAGTCTT TTGAGrGAO>k ATATGTlGCrA 
59201 GGTAI I IGI I COVTUTAAAA TTACTTCATT TGAATCCCAT GA7WAGAG 
59251 TTAATATGAA CAATCATATT I IGI 1 1 II 1 1 TTATATCO^ GTTATGAAAA 
59301 CCAQGCTCGC TCTAQGCAAA ACTOGGOVGr AGTCTOGAAT AtATGATTCt 
59351 GCCAAGAAGA TTTTGAAACA TGAAGGOTG GGAGLI I I I I ACAAAGGCTA 
59401 TGITCCCAAT TTATtAQGTA TCATACCTTA TGCAGGCATA GATCTTGCTG 
59451 TGTATGAGGT GAGI I IGIAG AAATCTTTTG AATPOGAAAA TGCAGTTAGA 
59501 TCrTCTTAGA ATPOGACrTT ATATGAAGAA GTAGATATAT ACGAGAAAAC 
59551 AGTCTUTCAC CAGAAGTAAA TTCAAGCATG TGtTATTnGA ACnTCAAGT 
59601 AACrreAGTG TGAATATGCA TGOGGTCACT TmSTATTAG Al I I ICIIGG 
59651 GAATPGCnr TGTTAATGAA GAGTAGACTC AAAGrTAGGT ATAGI IGI IC 
59701 ACOTAAAAG GTCnTCTAG AGAI 1 1 I I IC CI I IGI 1 I IG GATTTGCAAA 
59751 AATCreACAT TAAGCOWGT GACTAATGrC ACTAAGATGA GTAATACAGT 
59801 rrCATTCCTT GTACQGAAGA ATACAAATCT TGGATCAACC CTGCAATCTA 
59851 AATCATITAA TAATTTATGA ATCTCAONAA CAATTATPGA GCACACACTA 
59901 TACAAACCAC TAGGITAGAC ACTX3GATCTG GQGATTCAAA QGACTCAATG 
59951 TUTGCCtTCA AGAAACTGAA GGTCrOGnnQG GGGAGACAAA GGACTAAAAC 
60001 TCAGCGTGGT TATCPGTCCT GGGACAGACA TGAGCCAQQG TGCATGTTAG 
60051 GATGAGACCr AAGCTACAGC GTAGAQGAAG AGTOGAATGT GTAATGAAAA 
60101 GAAGAGTCGA Al 1 1 II 1 1 I I TAAAGAGCTT TATTGAGATT TAGTTCATAT 
,60151 TCCTTACATT TCACTCATTT GAAGTXJTACA AGCAAATQGT nTPOGCTTC 
60201 TTACATAATT TTTAAAAATT ATTATAAAAT ATAAAATTTG CCATTTTACT 
60251 AATtTTAAGT GTACAATTCA GTQGCATTAA TTACATTCAC AATATTCPGC 
60301 AACGATCAAC ACTATTTCCA AATCCmTC CTCACTCCAA ACAGAAACAC 

60351 crrAACcnr aagcaataac ttcctaccct ccgtaactca AACcrrroGT 

60401 AACCrOAAT CIGCI IICTA TGrCTAGQVA TTTACCCATT CAAGATATCT 
60451 TATAAGTAGA ATCATAOVJT Al I I I ICI 1 1 I IGIGICIGA TTtAtTACTX: 
60501 TTAGCATAAT GTCTCTAAGG I I IGI lOkTG TTCTAGGATG TATCAGAACT 
60551 TCAI I ICI 1 1 TOXTCGCTGA OTAATATTCC GTTATGTCTA TATACCACAT 
60601 1 1 IGI I lAGT CCTTOVTCTG TTGAAGAGG\ TTTQGATTAT TTCrACmT 
60651 CCAACATTCT GAATAATGCT GCAGTGAACA TPOGOVTCre CGTAtCrCTT 
60701 CGAGTCTATC CCTTCAATrc CTTnOGGTAT ATATCTCAGA ATGGAATTQC 
60751 TGAGCCATAT GGTCATTCTG TGTTTAGCTr TTAGGAACTA TGAGACTCHTT 
60801 7TCCATAGTG GCTGCACTTA CATTCTCACC AGCAACATAG AAAQGTTCCA 
60851 GIIIIIGCAC GrcOTTATTA ACACTTAATT TCCATnTAA AAAAGCTTAT 
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60901 TTTTATTATC GCGGTCCTCT TAQGKJTCAG GTOGTATCCn" TCAQGACTTT 

60951 ACTTcrrcre crcAbi i ii i taaaaaattxs tgattaaaaa cacataacat 

61001 AAACmTATS ATTTTAACCA TTTTTAAATA TATAGTACAG TAAOTGTrTAA 

61051 crcrrrcroG TmnTCroc aacagatctc TAGAAcmr ToscrrcrcA 

61101 AAACTTAAAC TCTATAGTCA TTAAACAACA GCTCCCAATT TCCCOTCAC 
61151 CCCAQCGCTC TCTAACCTAC TTTCTCGrTT TAlXyVGnTC ACTAOVTTAA 
61201 ATACCrreTA TAAGTCAAAT CATGTGGTAT TTCTCnTCC GrGACTOQCT 
61251 TATTTCATCT AACATAGITT CXTCATGATT CATCCATATG ATAGCATACA 
61301 ACAQGACTTT 1 1 IGI 1 1 1 lA AGGCTCAATA ATAAI I IGM 'GQCTATATAT 
61351 ATCACATTTT CTTTATTCAT CIGI IGAIGG ACATTTOGAT TCTTTCTACA 
61401 TCTTGACTAT TUIGAATAGT GCTXKiAGTCA ACATQGnXJT GCAAATATCT 
61451 OTCAAGATA CKJITTTCAG TTCTTTTTCA CATATACTCA GAAGTCGAAT 
61501 TrClGGGTCA AATQGTAATT CTAI 1 1 1 lAA GmTPGAQG AACCTCCATG 
61551 TCAI 1 1 ILCA TAGTAACTAG ACCI 1 1 1 Ibl TmTAACAT TTCTAtCAAT 
61601 GTACACCAAG ATTCXiAATTT CTCCMUTCC TCCCCAACAC OMTAAGTGG 
61651 GGraGTTGGTC TACTACTATT GCTCTCTTCC TCTTTATTOC TCCCTTOVGT 
-'61701 TCTCTAAGnS I I l(iLI iCAT ATATTTAGGA GCITAATATT AGGTCCATAT 
61751 GAAGTTATAA 1 1 ICI ICCTC GTAAAGTCAC CCATTTATCA TTATGTAATG 
61801 TCCATCrmS TCTLI IGIGA CAGI I l(,IGI CTTAAAATCr ATTTTCTGre 
61851 ATGTAATTAT GGCCACCCCT TTTCTCTTTG GGTTCCCGTT TTTATGGAAT 

61901 ATcrmrcc ATccnrcAC nTCAGOTA TGrcrcrccn: tagatctaaa 

61951 GTCAGTCTCA TAGATAAQGT ATAGTTCATT CTCTATCrrei: TATTCAGTCA 
62001 GGAATTTATA TCTTTTAGTT AQQQGATrTA ATCCATTTAC ATTTAAAGCA 
62051 GTTACreATA GGGAAGGACT TACrGTTGTC ATTTCGGTAG CTACCI I I I I 
62101 ATCTTTGTCC TGrGGCmT LIGI I I I ICC CrTCCTCrcr TCCTGGCTTC 
62151 TTLIGIGI 1 1 TCTTGATTTT 1 1 I 1 1 1 I 1 1 1 GTAGTGATAT Gl ICIGATTC 
62201 GCTTCrCATT TCCCM IGIG TQCATTCTAT AGATQCTATT 1 1 IGIGGI lA 
62251 CCATTGCAAC TACATAAAGC ATACTAAAGT TATAGCAACT TATTTTAAGC 
62301 TGTTTACAAC TrAACTTCAG TOGTATATAA AACrCTATTT CTTTACATAT 

62351 trcACcrccr ccccacaaac TTTArGicn ttgatattct atatccttaa 

62401 OV^AGATTTA^TAGTTAC^rT TTATQCmT CTTCTTTAAA TrCtXJTTTAA 
62451 AIMIGIIII TCAAATTTAG ATTTPCAAGr TATTTAtATA CCTTCATTAC 
62501 AATACTATAG GATTTTATAA TATTCTAAAT ATTGACCTTT ACCATAGAGT 
62551 TTCATATTTT GIGGI I I IGL GTreCTATTT ATOVTCCITT TGTTTCTCCr 
62601 TTTAQCCTTT CrrcTAQOGC CXSGfCTAGTC GTCATAAGCT GTATCAQGT 
62651 I IGI I IGtCA GOGACAGTCr TAATTTCTCC IMI IIGAAG GGCAGI 1 1 IG 
62701 CCCATACAGT Al I I I IGI I I GGOVG I 1 1 1 I TTAAGTITCA AAACATAGAA 
62751 TATAACATTC CATTTCCTTC TAACCTCO\A GATTTCCATT GAGAAATGCA 
62801 CTCAATCGAT 1 1 I 1 1 AATCC ATTGAGATAA 1 1 1 M lAATC GTCTAGGATT 
62851 TAAAATTTTT AGTCTTACAG GATTAAAAAA TTAAAAAGTt AAACTTGTTA 
62901 TATAACATAT TAACATGTAT TTTATACrfA AAGTATCTTA TGrTTTAAAAA 
62951 GTTCATTATC ATATATAtTT TATACAGITT CTCCTAATTA TPGCCTTCTA, 
63001 ATGAAAtACA GQGACCTAGA GTAACAGQGA TAAAGTATQG CCI II IGATC 
63051 AGCAGGCCTG Gl ICIGAGTC CTTCITAAAA AAACTCTGGG CCrGGTCTOG 
63101 TQQCnCATGC CrATAATGTC AGCACTTTGG GAGQCCGAQG CQQQGQGATC 
63151 ACCTXSAQGTC AGGAGTTTGA GATC^GCCTT GCCAGCATQG TGAAACCCTC 
63201 TCrCTACTAA CAGTACAAAG ATTAGCTGGG CGn5GTX5GTG GGTGCCTCTA 
63251 ATCGAAGCTA CTCAGGAQGC TGAGGCAGAA GAATOCTTTC AACCTCGGAG 
63301 GO\GAGATTC GQCCACTX3CA CTACAGCCIG GGTGACAAGA GGGAGACTCC 
63351 ATCrcVWVA AACAAAONAA AACTdOQCTC AGATGAATTT tTCrCATTTC 
63401 TAAAATCAGA ATAATAGATT TATGTAAGAG 1 1 ILIGIAAG GCTCAAATCA 
63451 AATATATGTA AOGTCrAAAA TCAGATACAA TTAGTAGAAT TATATTATTT 
63501 TATTAATACr CACCATAAGA GGIGI ICI 1 1 AGATCCTGCA G(Jbl I IGLIG 
63551 OQCAGTTCAC Gl I IGI I lAG AAGAATCTCA GTAACCQGTG CAAACCTCAT 
63601 GTCTTCOGCA CCCCCAGPOG CCTGCGACCT CTGCACAGAG TCACCGCCTC 

63651 crecAGrecc tqligli ict gcaaatgogt gggctcatcc tgcagaaacg 

63701 GGGCTTCKA TGAQGmSAG AATAGCnGTC AAAATCTTTA CGTrcAAGTT 
63751 GTAGAGmOG TTAATTATTT T L I I C I I lA T TTOCTCGCA GCTCTTGAAG 
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63801 TCCrATTQQC TQGATAATTT TGCAAAAGAT TCTCTAAACC CTCGAGTCAT 
63851 GGTCTTGCre GGATGGQGTC COTATCO^G CACCTGTXjCT CAGCTGGCCA 
63901 GCTACCCAIT GGLI I lUild fiGfiACTOXA TQCAQGCTCA AQGrGAATTT 
63951 TPGATTACAG MCCACAOOG ATAAAAGTCC TCC ACOm tA AlblGLI 1 1 1 
64001 AGAACTCCAA GTTCrACTAA GATQCAGACT GTAGnTTAA GACAGTATTr 
64051 CrCAACGTT TTTTCATTAt TtKICTCCTTA AGGAATCTTT TCAGAAATTC 
64101 TTnTCTAAA TQCTCCCrCG TCATGAAATT TTAATGCXiAC AGAAGCATTC 
64151 OVTATCTACr CTATCCATAC ATATCCCTTA TAGATAAACA GAGTACTATT 
64201 TTTTTTGACr GrnGTTACATG OVCGmTAA GATTATAAGC nTAGTATCT 
64251 GATQGATTTC GGTrCAGATC CTTGCCrCAG ALI ICI IGGG Gl I 1 1 lAATG 
64301 QGAATCAAAA TTCTACAGTC TTCTAAGAAT TACCAACAAT ATAAATAAAG 
64351 OOXTTQQCJr TTCTTAAATT TTTQGrAAAT GdlGbl IGGA ATOM i 1 1 M 

64401 AGrcrnocxjr agaccctaca agttttgagc tctcattxict cctcactgtc 

64451 ACACrercrc CATTCTTQGC TTTGATTACA CTCT ACCAT C CreGTTGTTC 
64501 TQCCAGCCOV TnGATAACTT TrACCATTTG CRSGCmTA TTGCTATCCC 
64551 CACrCTATTA. AAGTATQCAT TCAAATQCCT TTCnTTCTC TTTGATQCTT 
64601 TCCrTCGTCA GTCTTATCCA I IGI 1 1 ILI I AAGTAGTACA CCTPOGGCAT 
64651 CTACAGCrcr ATTCCCAACC TCCOTCOVA GK5CCAGCCA CAGCAACCCC 
64701 AGCCAAGCAG TCAGTAACTA ATTCGCAAAT ACTCCCTCAG CCATTGTCCC 
64751 ATTCTAGACA CTGCCAGATG CTAGGGGrrAG AGCAGTCAAC AAGTCAGGTG 
64801 TGGCCCGGGC AGTGTAGAGr AGAGAAGACG TTATGTCCAG CAAGTAAACA 
i54851 ACLIWal lAA ACCAACrCXT CTTTTCTTAG QQGAGCACAG AGG^AQGAGC 
64901 TATAACCTAA CrPOGGCGCT GCAGAATGCT GTCAGTGAAG CTCAGACrOG 
64951 AAAGATGAGT GGGAGTTAGC TQGGCACAGG CCAGTGGAGT QGGAACAGAA 
65001 AACATTCCAG TTCAGGGAAA GCATGTGTGA AGACACPGAG GGV3QCACCA 
65051 ACATCGrcrA TTTAAGGAGC TGAGAGACAG TOVreGCTCT AGAGAAAAAC 
65101 AOWVGTAGT GAACTACACE TTtCrPGTUr ATTCTCTCAT TTCACCATCA 
65151 TAACCATCrr GGGGATQGGA ATACTAACAT TATCCCCATT TTTCAGATGA 
65201 GCAACrOGQG CAGAGAGAAT TTAAGTAACT CCCACAAGAT TATACCTUTG 
65251 GTAAATAGre GGACFGAAAT TCAGACACAT GCAGTCTCAT TCTAACCCTC 

65301 crcrcTCCCA gctctgatcc agaactttgc atcactgata cggctgatag 

65351 ATTTGTCTATG GCTGATAGAC TGrCATTTCT GAGCTAAAAG TCTGATCATT 
65401 TTACATGrcr TCAGAOVTCr TTCCAGCCrT TCGGTUTCAG TTCCAAAGIT 
65451 GTTAGreQGA ATTTCAAAGC CTTTAATAAT CTAGCCCCAC 1 1 IGI IG^CT 
65501 CrCTCTCTAA TAACXAOVTA CAACAATPGG CTQCATCrcC ATAQCACATC 
65551 GTACrCCrCC CGTTCrCTTC GTPCTGCCAG CAACACTQGT TTTCGCTTTC 
65601 TOTCCrGGT TGrTCAGGTC ATTTCCAAGG CCCAGGTCTT TbIGLI II II 
65651 CCCAAGCTTC CCAGAGCTTC TTCCATACTC CCCTTACTTC CTGAGATTTA 
65701 ACTCTTCrcr CrrCAQGGCr TGrCTAGTAA GAAQGAQGCA GCAGCAGCAC 
65751 TCTOGGGTOG TOGAAAGTCT ACCAQCTTTC GAGTCAGACC ATTQGATCTC 
65801* AGGCCTACCA TnTCTACTT AGAI I I I I II. AQGACAAATT TCrCCATOT 
. 65851 TCTAAGCCrC CAATTGCrCA CTTACAAAAT TCATATAAG^ TTTACCTTCC 
65901 AAGATTGGTA TTSGAAGGTAA TTAACCCAGr ATTTAGAACA TAGTAATTAA 
65951 TAAATAACTA TTATTACGXT CATTACTATA GTTAGGACAC TCACTCTTAG 
66001 GTQCTATACA AAGAQGATDV TAAAAQQGAT GrfOTCTTQG GLMCIKiGA 
66051 ATAAATCTPG TCCTTTTACT GTATTTTAGA ATATOVTTCT GGGTCATAAT 
66101 IGI I IGI IGT CATAATAATG AAACATACTT GAATATTAAA TTACCCTCrr 
66151 TTTTTATTTT TTAGCCATGT TAGAAGGTTC CCCACAGCTC AATATQGrPG 

66201 GCcrcrrrcG aggaattatt tccaaagaag gaataccaqg actttao«3a 

66251 GGCATCACCC CAAAOTCAT GAAQGrGCTC CCIGCTGIAG GOVTCAGTTA 
66301 TCTGGTTTAT GAAAATATGA AGCAAACnT AG6AGTAACG CAGAAATGAT 
66351 GrnSCATTTT IIGLIIIAGC CTGATAATTG AAACTTtCAA CAATCTCTCG 
66401 AGreACTTTT TCTOCTOGAA TTGAAACAAG TCTATQGCAA AAGAAGCTGC 
66451 All I II I ICA CAAAAQQGAA GATQGTAACA ATQGTCACTT CAAALI I I IG 
66501 GQCrAAATTA TATGTAOXCA GAAATGTTCA AAATCATAGT TTTAATGrcr 
66551 TTTGAAAAGG CCACACAATT ATACTTTATC II MCI lAAT AATCCTGCAA 
66601 ATCrCTGCGC TGAATCC3GAA ATCTGAAAAT GTACTOQCTT GAAOVAAATT 
66651 IGIIIIGIGI GrrAGAOTTA TAAATCATTA ATCTTTATTT CQQGTOGTTT 



FIGURE 3W 



Docket No.: CL001103CON 
Serial No.: To Be Aligned ' 
^ Inventors: Gennady M^RKULOV tal. 

Title: ISOLATED HUMAN TRANSPORTER... 

66701 ACGnTATQC CAGTTCCTTT ATATTTAAAT TTCI IGI I I I ATATATTTTG 
66751 AAIGICI I lA TAGAI MCI I TAAATTTCCT TATAGAACCA TTAATAGAAA 
66801 ATCATTACAT TTAAAATATA CCTTAGAQCA AAAGCATCCA AATAAGTATA 
66851 QOGnTATCT CaTATTTTT OTnCAGCre AATACGAATG AGCACAGTQG 
. 66901 TQGAATTTCr GAAQQGAAGT GATGAAATTA TATTTATFTC AGTCGGCACT 
66951 TTTCCATTTT ACOVCTCTAC CATTATTTTaG tTCCr3GAOT TATACACTAA 

67001 TrrrcAGrAT ATTAcrxjrrA aattaccaac acaaqgcaat ttatttgaaa 
67051 GATTCOGnr ATCcracovr TQcrrreAAA agovgcaqga aacgaaatcc 

67101 TTTCACrTCT ATCAGCmCT GCAGAGCATC 1 1 101 M ICC TTTOTCCTTT 
67151 GrrrCCTACC TTTTCAATCA GATTCCGTTT TAGTCAQGAA GACTTCTTCG 
67201 GACCATTCTT AdTAACCTGA AAI I ICI 1 1 I TTAATTQCAT GAAGTGGATT 
67251 GATCATGAGC AAATCATCTG OTATTTCTC CCrCACTCTT GAATATCTTT 
67301 GAACTTQCTG TTTTCAATAT GGGONGCACA AAGGTGAGAG ATACATATTA 
67351 ATAGTAGTAT GTATTACrCr TATACATTAG ATACCTATAT TTAAATGAAA 
67401 GGCCCAATTT GTAAACATAT ACATTCATAT TCKTCrrcC CCCAAGTTTT 
67451 AGGAACATGT TAQGATATAG GAGAGTTAAT TTATAATAAT GAGAGCATTT 
67501 TTTTATTTTA CTAAAQCCAT TnTATAGTC AACTATCTTT TCnATITCr 
67551 GTGATTAGAA CTTAGAAAAA TATTTACTAG TrGAAGTTAT TATOVGITTT 
67601 TAATTTAGTT CTTAAACTCA TTTCACTTCT AATAATTTCT GTrATAAATT 
67651 GCGAGCATTT TAATCAAAAT CTAATGATGT AATAGGCATT TTCnTATTT 
67701 GAACCTACCr CTTTTATTTT CfGAACCAAA GAGAAAGATG GACTCGTCTT 
67751 TCTCAAACAT TTTTAAAAAT GTAGTrTCAT TTATATTAGT T AIG I 1 1 G AT 
67801 AAATCTCrOV CTATTTTTAT AATATGATAA GCOnOGGATT CrACTTTTAG 
. 67851 GGTTAI I IGI ACnT TGAGT AATATATAAA GTCACAATAT TAAGGTACAT 
67901 GATCAGCrCr TTCTAII I I I ACTCGTAAAA ATTATCGAAA TGAATAATTT 
67951 TGCTAACAAC TTTCAAATTT 0\AACI ICIG GAAAATATGA AAATATTCAT 
68001 TGITCATTAT GAATTTAAAT TGrAAGGTAT GAATGTGATT IGTCTGTfiCA 

68051 TcrrcrATCT tttcowvaa atgattctct atci i i igga aaaaagccga 

68101 GAGTTGAAGA TAGTATATTT CTTOGTAGTAC TGAATATTTA CTTACAGTTT 
68151 CTATGWV AA TATATATTTG TTTCTAAAAT TACI IGI I I I CCAGIIIIIA 
68201 I 1 1 1 1 1 1 lAG AGAAAATTCr TAAGTCTG^G TTTCCTAATT GAAAAAAAAA 
,68251 AATtAtAAAT AAAQCAAAAA TTGTATCCrA C^GCTTAGCT AGCTTAGATG 
68301 TTTQGCACCA GTrTTCAATCA TGCI I I I lAC AGCreGGTCC ATGrAGrOT 
68351 TCGVAAWTT TTQQGCmrC OTGAQCAGCC OTCTAGATA TnGTCTCTAT 
68401 GATCCAmr GACACAAGGt GATAI I I 1 1 i GTCATATCAA AATtCCACAT 
68451 TTACCCATTA GAGTTACAGC CCrQQGGTTC ACAGTACOVA QGGGGACCCA 
68501 GAGCCrCAGG ATR5GCCAQG CTCATiTreC CGTGGAGTAT CAGI I IGICT 
68551 TGAAATTGTG GGAAAAAATt CrAAGTTGAA TTCACrGGTA AGTAAI II I I 
68601 TAAAATTTCA TAATQCAGAT TACATCCAAA AnTGATTTA AAAATTAAAA 
68651 CATAAGACre CAGAGAAATT CTCCATrTCA ACfCCAATAC TA"nXAGACr 
68701 TCAGAAATAA CTTATCAGTT ATTTOGrAA GCI ICI IGCT TACCTGGATA 
68751 CCTXSACAGGT GAGATGGCTG TAGCAGACAC TGGCAGTTCC CTCCCCACAC 
68801 ACCTGTCCCr GTCCACAGCT GCACAAGGCA GCTCTGTGTG CAATTGQCAG 

68851 CATcrecrcx: TcrcrrcrcA ooGAATcrn: gttagaaaaa tcctgccata 

68901 IIIGIIICIC ACCTATTAGr CrTCTGrCCC AGTCAAGAGA ATAAATTTAT 
68951 GCAAGCAGAG ATTGrACTTT ACAGTATTTT GrCTTTGAGC TTQGCATTAG 
69001 GTTCCATnG TAAAAATGPG GCATQGCTTC CTCATCCCCC AATAGGAACT 
69051 TTGCCAGCCC I I I IGI ICTC ATQGAACTTC CI 1 1 1 1 IGAA AAGAGCACCA 
69101 AAQGAGTAAA AATACTCTOG AQGGAQCAAC CCICCM IGC CATATQCTCT 
69151 CATTCQGAGA CA1?CTQGAGG AGTCTCAAGT OVTTTAQGCC ACTCTCrOGG 
69201 AGAGCACATC CTATCATCTT CrCCCAGCCr AGCCCCTTCC ACTCrGCrCA 
69251 AGTCCAAGCr GACCAGCTTT CTCACCACAG TCTAAACAAA GATGATTCTC 
69301 AGTGGGCGCX: AGAATGCTAT AqCCAGA (SGQ ID NO: 3) 

FEATURES: 
Start: 2132 
Exon: 2132-2314 
mtroh: 2315-17055 
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69000 T G Beyond 0RF(3') 

69134 c . . T . . Beyond 0RF(3') 



Contect:. | 

, DMA 

Position ' 

1722 . . TTSCCCAGQOVGATOGLKjI IGATLI 1 1 ICIGOVVOWVrcOVQGAlLiI I ICICLI 1 1 1 ICi 

TTTTATMTTCCrCCMTAGATCCrrTAQGAmMCTCrCICaCI 1 1 1 IAAAGG\GAATC 

GGCATCCCAGGTCTCCAACCAOGAAAAAATTAGACATCCGTXSAGAGACMT^ 

GGCCCAGTTrC0AGGb\GAGAGAAG(^GCrCn3QGCrGACCGCCAAQGCrC^ 

AGGGTCTTTAAGrraSAGrAAayVGTCrrOW^ 

[G.C.A] 

CGaXK^VQCCCra5ACCTCaX3GQQGCCTCTTCCr(XGACC(^^ 

TGGC>\AaX3QQO\GAQ(n^CGACO:GQQCTCCX5CACAGCA 

QCTCGCTCACrrcCJQQCTXnxnXXy^QQGQQQa^^ , 

<XAGCrccrCCCCA(kyVGCQGCGGGACGGCCACACCCTC^^ 
, • GGCTCrCCGCTCCrQCGCCCTCCGaa:GGCAGCGGCACCCCCGAa3^ . (SEQ ID NO: 35) 

1767 AGrnCTCOrTTTTGrrTTTATAATTXK^^ 

... .TTTTAAAGCAGAATOQCCATCCX^QGrcreOV^ 

GACAATGCCCrcG^TCGCCGAG^rTCCAQGCAGAGAGAAGCAGCTaT3GGCf^ 

* . . GC(^CCGAC3QGGCreAa3CTCCAGC(rraGAlCr^ 
: [C,G,A] 

. . GAGACCOVGCTCCCAGCrCCCTCACTTC0QGCrcrCTX5GAGGCGGGCCO3GCCAGreCa 

CCGAGGGGAGaSCGGCGAGCrCCrCCCCAGCAGCGGCQQGAaSGCOVCACCCrQCGCGCC 

GCGGQQQCrCQGGTTQGGGTGrCCGCrCCTGCGCCCTCCGCGCCGCAGCCGCACCCCCGAC 

GGG5CCCCAAACGCITnTGGQC0GCQQGCCCaK:a^ . (SEQ .ID NO: 36) 

1840 .... TCGCGATCCCAGGTXnXKMCCACGAAAAAATTAGACATCCGTCAGAGA(^^ "n 

ATQQCCCAGlTrCCAQQCAGAGAGAAGCAGCTCTX5QGaXi\C0GCCAAG^ 

AGAGGGICI I IAAGTCGA(jrAA<XAG1XTIX>\AGAC)CCGGC^ 

TCAOXnXSCAGCCCTXjGACCTGCrQQQQGCCTXTrCCTCGGACCCtKATGC^ 

GACTXKKZAACraKK^VGAGGTCGACCCCQQGrCCGCACAGCACCrCCCGAGAeCC^G^ 

[C.G] • 

CAGCTCCCrCACTTCaK3CrcrCTOGAQGCQQGCCGQGCCAGraCC3QCa3AQQ^ 

GGGGAGCTXXTCCCOVGCyVQCQQCQQGACXaQCCACAC^ 

TGGQGTCrCCGCTCCTGCGCCCTXX3GCGCa»VQCCGeACCCCCGAa5GaK:CCCAAAOG 

•' . . CTXjrrGGGCCGGQCGCCCGQCCOVGCCGQGCCraXIGCTCGrCCaSGrcr^ 

. (XCmS^JO^XOSTGfi^^ (SEQ ID NO:37) 

1857 . . OVACCAGGAAAAAATTAGACATCCGTiyVGAGACAA^ 
■r- . ........ CAGAGAGAAGOVGCrCTCQGCTCACaSCCAAaKTCCQGCCCGAGAGGGrCTTrAAGT^ 

AGTAACGAGrcrrCAAGACCCCGCrCCCAAGCCACCGACGCGCrGAOKTCCAGCCGTXjG . 

ACCrecraQQQGCCrOTCCTCQGACCCQCATCCTGACAGCQGGACTGGC^ 

AGGrOGACCCGQQ(n'CC3QCyVOVGCA(Xro:OGAGACOC»GCTCCCA^ 

[T.G] 

GCTCTCTQGAQQCGQGCCCQGCCAGTGCCGCGGAGGCCAGCGCGGCGAGCrCCTCCCCAG 

CAGGQGGQQGA0QGC(^CACCaTKXjCGCaK3QCQGGCrCGGGraQQOTCrCGGCTCC^ 

OKICCreGGCQCCQOVGCaXTVCCCOeXSAaSQaKICCOV^ 

COGCCCAGCCGQGCCTCDGCGCTXSGTCCaiGrCTCGCCCOGCAGCCaXIG^^ 

crrccrcQGCc^QGCOGCcrecGCCTCTGQGACCATGrnRGCGaGGcr^^ (SEQ ID no: 38) 

1945 . CAAQQCrCd3QCCGGAGAGQGr(TrTAAGraGAGTAACCAGrCTTCAAGACC0(^^ 
AAGC(^C)GAOQGQCTCAaKnXXyVQGCCTQGACXnXK^ 
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GCATGCTCACyVGOQQGACreGCMCnX^ 

. CCGGAGACCOVGCrCCCAGCTCCCTC^Cn-CCDQGCrcraX^ 

CCGCOGAQGCOVGajGQGaSAGCTCCrCCCCAGO^KBGCjQ^ 

[G.T]. 

. .. . CCGCCmXXTCGQCrraSQGTCTCCGCrCCKmiCC^^ 

ACQGaK:CCCAAAC!GCrCTTCC)GCdQCGC3GC^ " " - 

. GCTOU3CCC03:AGCCCTCGATCTCCCGT^GACTrCC^CGGCO^GGCCGC(^ 

. QQGAOD\TTmX5aKnX3GCreC!QQGA(TrCXSre^ 

GAGOVGGGGAGGGGCTAOGAGACXXICI ICC>GGCACrGGACX]GCAATGGGGAlGGGAGTX3 (SEQ ID NO: 39) 

2007 . . . GCCACCGAGQC3GCTGACGGTX3(:AGCCCraGACCnnGCreGQQGCCrOT 
ATCXnT^<:AGC3QQGACrGGCAACraQGCAGAQOTCGACCCCGGGrC0G(:AO^ 

a3AGACC(:AGcrcccA(CTCCcrcA(xrcc3C3Gcrurcixx3AGa 

GCGGAGGCCAGa3GQGCGAGCTCCrOCCO«K>«KjQGCaa^ 

CGa5CQQQCTO3Q(JTmKnCTOa5Crccra^^ 

. .. [A.C], 

, QG03000O:^fiJ^aXJGrr^^ 

TOXDGCCCCG^GCCCraSATCrCCCDCTWCTfCCrClGGCCAGGCCGCC^ 

, . . . GACCATGrrxmrOGCTGCGGGACrrCGlTKnTKICCACCCKIGGCGR^ 

GCAO:CGACGGGCTACGAGACCCTaTCO\GGC:ACreGACC(K:>\ATQQQGACQ^ 

QGACATOKaCXSAQCreCAQGAQQQGCrCAQGAACaXK^ . (SEQ ID NO: 40) - 

2769 . TO3QGCCGCX5ACCQQCGACCCCGGrAAO«3AAGreC5^ ' 
. , ATTTCrCCAGATAAAATGAblGI IblGGACACrCTQCKICCACaiGCACrcrrAAATTTTT 
... AAGACACI I I ICjICCTCAATCCATCCCAQbl ILI I Itil l I K-ldl I I lAATACOTGCAG 

ACATOTAATCGGrrTTAQCTtjTOVGACrrOVtnXi^ 

CAC^TTCGATCTXTTTOGAAG L I CiC I I I bl l A OVGCAGCrATgTXn A TTgT OAL I GI 1 1 

........... [C,G] 

...... AAAACrcrrTCAAAACCAATCQCGIGI I ICGCCCACTTCCIGI IGAGAAQGAATQGCQGC 

ATTCCATTGTrTAAGACATTCCrAGOTTAATGCCCTAajrAC^^^ 

. : TTGACTRy^CCimSACTGAGCAATTTCATTTrCTC^ . 
AACTIOKCCaTTAGTAGGGTCGAGATATGrGGAACrrCTCCAA<X 

TCCCny^CACreQCATTCTWATCrAAAGAQQG^^ (SEQ ID N0:41) 

3664 . GCTCATTGTCiXAGAAATQQCCC^ 

. AGCAGCCCAGGAAAQQGACGACCrracreCAGlTKArCAGCAGA 

TAGAGAGRXW^CTCAACTGTXJrrCCrCACAGrAGGTXSCCriT^ 

GTACMCTCCATCCTCCCTACAATATACAAAAGCrCrnXjGAGTXCT^ ( 1 1 lAA . 

GATTXnAAAQQGATCCreAGATOWWVGCTTCAGAATTG^ 1 1 1 I A 

[C.T] . 

OTAAC RKyO'C ArATTLIGi lATAIGM IfalGTCATAGrATATGTTACCAATrLI 1 1 IIA , 

:\ . . AAT<^CCrrrrACTITATTGATAGrrrAAAAAay\T7UrAAGrGAAATTGC^^ 

... CLI I IGlATTCATTTTCTCATrCTWCCAGrrACnTCGrAGGATAAAII I ItiAQGAGT 

. QGA<^TTCCRy\GrCTCAAQOTAA0^OCATm^ 
AAACCTTAGACCCATnTC>VCR.1 1 1 KaACTCAO^blGLI ICTCCACATCCrCiGCr (SEQ ID NO: 42) 

3827 , GAAGGGAGA TCrCAG RaJTACAACTCCATOJrCCCrACAA^ 

TXKTCMTGATrrlTAAGATTXJrAAAQQGATCatSAGATCAA^ 

OXnATCACO^TTTmCGTAACnXK^TCATATTaXjn^^ 

TCTTAOIAATTCrTTTTAAATC^CCriTrACm^ 

TGAAATTCCAATOGATCTO-I I IGlATTOVTmcrCATTCran^CX^VGmCTrTCJCTA 
[G.A] 

GATAAATnTCAQGAGreGACATTCC I GAGrrUPGAAQCTAAOVCAO^TnTAMCraQGA 

TAOrrATTTKiCnTOQGAAACCTrAGACCCATTTTCACTLI I I IGACnGACAbiW.! IGC 

. TrCrCCA OVrCC rCGCrCATTQVGGCTATCAGrLi I IGlAAAGtcrCCTATTCroCAGCr 

. . GAAATTCCmTCATrTCCIblLI lAGTCCATTTAGIGI lUTATAGTCGAATATCTCAG 
AO\QQCTAATTTATAAAGAAAAGACATTTATTTAGCrOWj(VGTTC . (SEQ ID NO: 43) 
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4113 . . CAGTTACnTGGTAGGATAAAl I I IGftOGAGreGACATreCTCAOTCTCAAGGTMCACA 
CATTTTAMaT5QGATAC3CnATTXX:criTCGGAMCCrT^ 
ACreACAGTXKTnQCTTCrCCACATCCrCGCTOVT^^ 

TCCTATTCTCCAQCTGAAATTCLI 1 1 ItATTTCCItlCI lAGTCCAnTAblCjl IbCTAT 
AGTTKWVTATGreAGAOmn>VATrrATAAAGAAAAG^^ 
[C.T] ^ - - - - 

GCAGQCrQQGAAGmAAGAAGGGTCGTTKTCGCATCTX^^ 
TCCI GC I G I G I CA(^CATGGnxyWVGTOW\GTCGAAGTXdGACA llj I G I'GAAGAAGCA 
AAATC0GAQQQCnx;TmT3GCTTTATAGO\ACX:O«^ 
GAGGGAACTAATTCAGTCTCATCAGAGAGAGAACrCACTCACrACrc^ 
AAGCCATTCATX5AQGGATCTXKCTC0GTAACCCKy\CACCTCCre^^ 

4337 . . CATTTAGIGI IGCTATAGreGAATATOGAGACAGGGTAATTTATAAAGAAAAGA^^ 
ATTTA^XlX>^O^G^^CClGCAQGCTCQGAAGlTTAAGAAGa 

ACTCCraQQGAQQGCTTTCCrGaG'l GTCACAACATQGTGGAAAGrCAAAGTXiGAAGPGG 
ACATCnCTGAAGAAGCAAAATCGGAQQQGrcTX:CTX5GCTTT^^^ 
GGAACreATCCATrACnyVQQGAACTAATTCA^^ 
[A.G] 

aXK:AAGAATGACACCAAGC(^TTCATGAQGGATGrGCCTCCGrAACCCn^<^^ 
GCTAQGTCCCrCCrCCOVAbXCQGCCACATCAQGGATCAGACTTCAACATGAGI I I I IGI 
QQQGADVVACAAAACGrAGCALI IGLI I IGCLI II IGGn^CTATTCACATCCrcCACAQG 
ATTCCATTATGCCrACCiC A l I I GG I G AGGGOVST CI lU I lAAl IGGI I l A iCTGATTCAA 
ATCCTACjCCrOCrCXAGAGAOVrcGT^VCA!^^ 

4473 ... TTCCIGL I G IG I CACAACATGGTGGAAAGTCAAAGTCGAAGreGAG^TXnxnGAAGAAGC 
AAAATCCGAGGQGTXn'CCTCGCTrTATAGOWZCCAGCCTCGAQQGAAC^ 
TX3AQQGAACrAATTCAGTCT(^TG/\GAGAGAGAACT 
CAAGCCATTCAreAGGGATCTCCCTCCXJrAACCaxy^CACCrCCTXXTA^ 
CCAACAOGGCG^CATCAQQGATCAGACrrcAACATGAGI 1 1 1 IGIGQQGACAAACAAAAC 
[G.A], 

TAGOVLI IGLIIIGCX-IIIIGGI ICTATTCACATCCTCGVCWiGATTCt^TTATGCCrAC 
CCAI I IGGIG/«QGCAGR.I ICI 1 1 AAI IGGI I lACreAtTCAAATSCTACCCrCCrCCA 
GAGACATCCnCACAGACACACCCAGAAATOMGI I I lACOVCTTATCTGGGCATCGaTA 
GTCCAGACGAGTPGATACATAAAATTAACCATOXCACATGGGATAGAATTA^ 
ACTOXACmTATQQGAGAAAATTTCAGAQQCATCrcAia^ 



(SEQ ID NO:44) 



(SEQ ID, NO: 45). 



(SEQ lb N0:46). 



6455 . TGTTTATnK:ATTGAGTX3GAATCAQGATTTCACrCCAT^ 

AGAGGGTTCATTTCAI I I I lATTTCATTAATATTGCI I I I I 1 1 1 1 I I I I I IICIGGAGAC 
AGAATCnTGCTCTATGACCAAQGCTQGAGTXK^VGl^^ 
TLIGLI l<JLIGGATTCAAGCGAmi IGIGCCTOVCjCCTXZCONAGOV^^ 
GCACATXSCCftCXACACGIGGI lAALI 1 1 IGIATTTTOVVCTAGAGATCQGATTTTGC3CAT 
[T.G] 

TTGGrCAGGarSGTCTTGAATTCCTQGCCTCTAGn^^ 
GrcCTAAGAtrACAQGCATCAQCTAOCATQGCCyyQCCCAT^ 
OVGACATCmTOGTTTXnXjGCACAATATTAAGAAGA^^ 
ATTTTAGGGCATCAG^ACAGAAAGATTATOJ^ATAAGAAAAACAATCGA^ 
ATTrOCTCAAATCrrCrAAAATATATAAAATCreTATC! I liGIGI I CTCTCCTGATTT 

6533 TTATTTCATTAATAI IGLI i 1 1 I i 1 1 1 1 1 I M I ILIGGAGACAGAATLI IGLIGIATOC 

CAAGQCreGAGTXXj^GTlQGreClGATCTOSQCT^ 

GCGATTCI IGlGCCTCAQCCTCCCAAGCAGCTXSAGATrAO^GGCACATGCCACC^C^CCr 
QGrrAACI 1 1 IGlATrrrCTAGTAGAGATCQGATTTTQGG^TGrP^ 
GAATTCanQG0CTCTAGTWTXniX3CTTX:CTOGC^ 
[T.G.A] 

TGAGCTACCATCQCCAGCCC^TTrcaTAATATTTTAATT^ 
TQGCACAATAT^AAGAAGACA^GATATGAAATG^(:AGGGTGAATT^TAQGQ:ATC:ACM 
AGAAAGATTATCGTATAAGAAAAACAATCGAATTCXAACTAC^^ 
AAAATATATAAAATCTXTTAILI 1 1 IGIGI I GTCTGCTXy^TrTATATTCTAAATrreATOT 



(SEQ ID NO: 47) 
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TATCCTTCTtnrK^GAAATAAAGTXnOtWkAGMTCAAAAA^ (SEQ ID NO: 48) 



ATCAAATCACAQQCniGMTnTAQQQCAT0^OW^<^^ 

MTQGAATTCCMCrACAl I ICIljICAAATGTTCrAAAATATATAAAATCreTAILI I 1 1. 

GTUnUTCTCCrcAimTATTCTAM^ 

TCTXiAAAGAA7Ty\MAAAATQGAA^ 

TTTCTAQCATTCrAACKXTITnTO^CCT 

[G,C] 

TAQQCX>VOVQC]CATTnA(^tTGATOXnTAI 1 1 ICI ICTCrCTCCtTCAGAI I ICICIC 
ATTCCCCCTTCrcnX:crQGrATATGATTGCCCAI lb 1 1 lAAQGCCCCAACTCACCrTTA 
TAATiCTTCCrAGCCCALI I ICI I lATCQOTAmCAGAAAAAACAAAAGAAGaTCCACA 
AGACAACATTCTCTAATACALKaLI lAACI ICI 1 1 ItaAOCCTXSaxyVGTTCAAAAATGn: 
ATCrrTTTAAQGATTGAATOSAGTCOVCC^^ 

GATTCCCOM lb II lAAGGCCCCAACrCACCriTATAATCrTCCTAGCCCACI I ICI MA 

TakrrATTCCAGAAAAAAO\AAAGAAGCTrCCACAAGACAACA^ 

TAACI ICI 1 1 IGACCXnXXnGAGTTCyVAAAATGTATCI 1 1 1 lAAQGATTCAATQGAGTC 

CACCAAQGTATCrATATrTCACAQGATTTAlXWWVO 

GAAGCCTAACtcrcAAACXnXSGATCATAbibl I lACTACACATrAACibl I I lAGTXsGAT 
[G,A] 

TAATAGmTTATTATAQGaCTTQGAATCAGAAOVQQGTTCAAATC^^ 

TAGAaxnGQClCTnQQG(XTXnTAmAATQCXTCGA^ 

QCTAAGACCTAGCCAGTAACTrAQCATAAATAGrAAATn^TTCATTTAAT^^ 

CAGTXXlCAGACATTGrrTTAATGAACraiQCyV^^^ 

mATTCTATTXntAAAACCaXXCTAT^ 



(SEQ ID NO: 49) 



(SEQ ID NO: 50) 



TAATCrrcCTAGCCXIACI I ICI I IATCQG^ATrcO^GAAAAAACAAAAGAAQatC3CACA 
AGACAACATTCrcrAATACACICCi lAACI ICI! I IGACCCranTGAGTTCAAAAATCTT 
ATCI I I I lAAQGATTGAATQGAGrCCACCAAQGTATCTATATTTGACftGGATTTATGAM 
ACAAAAQGATTTCrrxyVGAAAGrnXyVAGCCTAACTCre^ 
CTACACATTAACIbl 1 1 lAGTOGATCTAATAGmTTATTATAGQCTCraGAATC^ 
[A,G] 

GGGrrCAAATGrrrrCACCGCnQCTAGACrcTGG^ 
GAQGCCrCAAATXJrrMcrAQGAATQGTAAGACCrACCO^JTAAa^^ 
AATnCATTCATTTA AICil 1 1 I CA AAOVGrGCCAGAC A l I bl I l A ATCAACTQGGGATATA 
GrGGTGAACAACACTCACAGail ICI ICATTCTATTCTCAAAACCCrcCCrATAGTAAGr 
AGGTCTGrGTUrGTUTUrAQ3TX3CATCQGGAATAAAAAATAAT^ 

TTAAQGATTGAATOGACna^CCAAGGTATCrAT^^ 
QGATTTUrnGAGAAAGrrTCAAGCCTAACTCTGAAACG^^ 

ATTAACIbl 1 1 lAGTGGAlTnAATAGrrATTATTATAGGCTXJTGGAATCAGAACAGQGr^ 
CAAATCrrTTO^CC3QGrrGCTAGAaxnXK3C<Tra^ 
CrCAAATCnTAACTAQGMTCCnVVAGACXrrAC^^ 
CA,G] 

TrcATTTAAIbl 1 1 ICAAACAGrSCCWSACAl Ibl I lAATGAACTOGOGATATAGTCGrG 
AACAACACTGAOVGCGI ICI IGVrrcrATTCrCAAAACCCrCCCTATAGTAAGrAGGrcr 
GTGTXnxnUKnAGGTGCATCGGGAATAAAAAATAATAAGCAAATAATGAACAGGGr^^ 
TTGAAAAAGCAGAAAGAGCTATTG^ACAAAACTACaiGCCTrmTTAG^^ 

McrcrATQ(nTrcTTcrcrccnn"(^TTCTCTTAAA^^ 



(SEQ ID no: 51) 



(SEQ ID no: 52) 



7589 . . AACIGI II lAGTCGATCrAATAGTTATTATTATAGGCrcraGAATCA^ 

ATCrrrrCACGGCT7GCTA6ACTCTQGCaTX3QG(^T^ 

AAATCTTAACTAQGftATOCnAAGACOACC^^ 

. CATTTAAIGI 1 1 1 CAAACAGTGCCAGACA I IGI I lAATCAAOXSGGGATATAGrnXSTCAA 

CAAOVCTGACAGGbl ICI ICATTGTATrGrOWV^CCCTCCCrATAGrAAGrAQGrCTGr 
. ■ [G,C] 

. TXnGTGTXnAQGTGCATQQQGAATAAAAAATAATAAQaWVTM^ 
AAAAAQCAGAAAGWGCTATTtAAO^AAACrACaXXICrr^^ 
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TCTATGCil II bl ICTCTCC TOTCAA TTOUrrAMTGaiGTCAGCLIGi I I ICCTTATCA 

CCCPQGCCACGALi ILUjILIIIICIIsLI I (iGTCCTUTAGACrCTMCCCAAQQCTCATT 

CrCTQCCTOGCTATCreCLI ILKjIGQCraTPGCCACrACCrACAi 1 1 ICIGIGI l(aCA (SEQ ID NO: 53) 

781D . _ . . CreOGGATATAG I GG ITiAAOVACACTCACAGCX, I 1 1 CATTCrATTCTCAAAACCCTCC 
CTATAGtAAGtAQGtGrcKnxnxnX^ ~ 

ATAATCAAOVOGOTAATTTCAAAAAGCAGAAAGAGCTA^ 

TATTAGATGAAACTCrCAACrCTATCGI I IGI ICTCrCCnjTCAAl ILIGI lAAATQCTG 

TCAGCLIGI 1 1 ILXTTATCACrcreGCXIACXSACrrCrGrLI 1 1 ICIW.I IWjILCIGIAG 

[A.C] 

CrCTAACCCAAQGCTCATTCrCTQCCraGCTATGreCC II C I G I GGCrCTTPQCCACrAC 

... CTAC AI I I I C I G I GI I G QVCAGGGAAGGACCATTCCXnnTiGACCATAAA A l I tlU 1 1 1 

TCAAAGAATTGATTCTTCATTTmrCACAGOACAT^^ 

CCACnGCrCAGONGCrcrGGGGGAAAAIGI I lACTGAGAAGGGTACAGTAGI I 1 1 1 1 IGA 

CrAAO^TQGPQ(^CCTCmx:CAGAQQGAAACCTATGAOT^^ . (SEQ ID NO: 54) 

9104 TTAAAaSAATTATTCTAGAAACAGAAAAAOVAATAC I G I G 1 1 C I CATTtACAQGQQGAQC 

, ...... TAAACCrroGGTAAATQGGGCATAAAGATCGGAACAATAGACACr^^ • 

GGQGAGGGAGGGAGGAGGGCAAQQGCTGGAAAGCTTCCTACrGGGTALI I IGI ICACAAC 

.... CTGQGTCATGGCAGGATTAGGAGCrCAAACCCCAGrATCAC^C^^ 
AGCreATOrrcrAACGCCIXWVTCTAOAAT^^ 

[G,^^ - 

, . GGAI 1 1 I lAAAAAGAAQGATTXZCTAGACAGGTGCAGCCAAACAAl I I I I I I I AAATGTTG 

GG^GGCCGGGACCGCC:AGTCACrrATOnTGGV\TAGCC(:ATCn-CCCAACAT^^ 

ACTTCrCTCGAAAAGAGAAGCTATACTITCAGATGGCCaiGrreC^^ 

GrrTCTXX5QGAAAQQGGCTreAGrnQQCCX3GAaX5GACrCrrCCr^^ 

GLI IL!GATOVGAC3GTCAGny\QQCAGGAACrcCXX3Q6TCl^ (SEQ ID NO: 55). 

9503 CATGTCCOV\CATTCCCMCCTACnO"CrCCAAAAGAGAAGCrAT^^ 

CIGIGCIGGGI ICICC3CrGGAAGI I ICIGGGGAAAGGGGCmiAGTTCC)CX:GGACTCGAC 

TCTTCCTCGAGreGGAQCOGQGGLI ICIGATOAGAOGTCAGTGAGGOVGGAACTCCGCGG 

TCTCOIAGCXX^GCCCAGAGrQCQGrCCCACGCAQGrCCCGGGrcaX^ 

. : 

[A,T] • 

...... I I GIG I I I G GOGOXiCreASGAGGA-re CIG I LI lA GGCCTmtXIQOXSGAOGrGrgre 

. GTQGGCAGAGATCCastTCGrcGGTaXACTTCCACCCCGCnXjGGCT 

QGAGCreaSAQQGAGAGATCCTCGATWiACrCCCrCrACGGAGATCILI I I IGGTACCTG 

GAiCTATAACAAGGATGGGACCTTQGACAl I I 1 1 GAGCTrCAGGAAGGCCTXSGAGGATGrA 

. . QQQGCCATTCAATCrcrAGAQGAAGOaVtfSGTOGCTCTC^ (SEQ ID NO: 56). 

9898 . . . , ACCCClGCTGGGGCrCACrCAQGCCGGQGAGCTGCGAGGGAGACATCCraSAT^ 

TCTAClQGAGATCrLI I I IGGTACOXiGACTATAACAAQGATCGGACCrrQGACAl II I IG 

AGCrrCAQGAAQGCCTQGAGGATGTAQGGGCG^TTGAATCT 

GrCTCACTXSQQQCTCrAATCAGAGAGAajnxmXTCGGAQCECT^^ 

CAGAGAGGGCAAAATTTAC A I G H G I U \AGCtTCACCTm5CC<^aGCAGTX;rrCAGOT 

[G,C] 

GrrxyVCCAGCGTTACCGrrrATTAAGAATAACAACACAGCrAAC^CAT^ 

TTTCrcarrmCTCCrrOKnTnAGrAAAATCTCCAACTTCAG^ 

TGGaACATAO^GCCTTGTCrTAGGAGrCACLI IGI I OVATCTXKTCACCreTCATTAGr 

CACCCAGAGGQQCGrCTAGGCTAAAGATTKGCCCrGCCCAGlTCAGAGAAGraG^ 

CAOCTAarnGrATTTCQGACTXmniQGreATT^^ . (SEQ ID NO: 57) 

■ • • > 

10196 GTOGTTCAaiAGCGnACGGTTrATTAAGAAW 

\ Al I I I ICrCCGrrTTCrCCTTCGatnTNGTAAAATCTCCAAaTCAGATTTG^ 

TOTTQGCTACATACAGCCI IGICI lAGGAGTCACCI IGI I CAATGTGCTCACCTXjrCATt 
. . AGTO^CCCAGAGGGGClGraVVGGCTAAAGATCCGCCCTCCCCAGrrGAGAGAAC^^ 

AATO\CTCTAC3GnrrATTTQQGAGTOQQCnX5GTC^^ 

. . [T.C] 
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12327 



13749 



14150 



14529 



14653 



amTOGrrCCnXSAAGQQGGCAGTQGMGTGQCrriTAaCT 

TCAQGriTCCTCATAATAlXXICrrAATTWAGACCCrAGrrATCAG^^ 

CTAACCCI ICICI ICCCO^GAAQQCTAACCrACAQGCTCCrTCrCAQCAItjl IGIW-I IC 

GrA(^TACTXCTATTGOVGTA7TTa>\AGr(^Tn^ 

TMTAATTAqTTAT/ywr^^^ 

GrcATCrrATTTAATCCaXSGAQQCCTCAAAlW 

AGTAACrrAGC^TAAATAOTAMTTO^TTCATTrAAItl 1 1 1 CAAACAGTX3CCAGACATT 
GmAATGAACTXSQQGATATAETOGTCAAOVACACn^ 
AAAACCCrCCaATAGTAAOTAQGnrCRnxnXjnnXJTX^^ 
'TAATAAGG^AATAATCAAO^ATAAAATTAT^mTTTAAAAAA^AAGAM^^^ 

[C.G.A] 

TPCn'ajKsl lAAGATACAAAAGGVOAACI I 1 1 lATTGrGAAAATAGTLIGI I 1 1 IGAAC 
AATAT A I I G I 1 1 I(jM 1 1 1 IC CTCTCAAAGnTGAGAAACTAAATATACGAAGAGATAATG 
GTdVGACCATAAATAAAAATAGAACriTCACTCAAAATTTAG^GCAGT^^ 
CCAGCCCrmTCTAAAATAMCAGACCAQGAAACONGCaXnTAT^ 
AAGTCAQCTTtCTATCTCTAGAGA<^TAO\CAA^ 



(SEQ ID NO: 58) 



(SEQ ID NO: 59) 



TACAQGCGTCAGCCACCATCCGCCCAQCCATAGACTATATAI I I I KaATCTGATAACTGG 

TTCAQCTACTAAGTCACTAAGAGGCAAlJrAGCATCrATAGTG^^ 

QGAC^TTC^CCrCXRSQQOVQGATQQCACAGAAT^^ 

GAATCCnxntjCAAtTTAAAACTTATGAGn^ 

T(^GACCATGGATTCACnX>VGGrAACrcAAACnn^^ 

[G,A] ;. 

GGACrATTUTATTGrrAAGTCAGACTCATTAGGCMTCATAACTCTn^^ 

AMTTjCrcCAGAAATATljQGrrAAAAAAAACnnTCAAA^ 

TAACTTCITACrilCCAAAATOTTAGreAAM 

TGACTAAGAGAAAATCI Ibl I I ICAGGATXSAO^GATTAAAAAAGAAGCAACTTCCTTGAAA 
CACTXWWVTCrCrCCACrrcrAAGATAACACAAAACraGCr^^ 



(SEQ ID NO: 60) 



ATAQOGTCAQOGATCrCCTTTAAU IGI IACTTX:OW\ATCnAGrcAAAACnGreQO:C 
CAAAGACTGAAAQGAACAMTCACTAAGAGAAMTCnXJl^^ 

AAAGMGC^CrrCXniSAMCACTlWWVTCTCTCCACTW » . ■ 

CTAAAAL I «3 1 1 QGMTGAATATOKIOVACrO^AGTCTQCACAGAACT^ 
moVG(XCAAATTTCO\CCA(^TATmATACrAAC^ 
[T.G] 

CreTO^QGTAGCATGAAGAQGTAACTATGCATTKCTAAQGACnT^^ 
TCCrrCCACCMTCACCCACrAATCCO^GAATCCGGCCCCAMCCriTTCrAATAACrAC 
OTAA^ONGG^TAGGGAGACAGATTTCAGaXXyXCrCC^ 
TOT^TAAAAAQCTTTTCTTTTOXIAAC^ 

GQQQOVQCAAGCCn_l 1 1 IGGrQQGreACTATTCntjl I GQCTWATITCCArnQGCOV (SEQ ID NO: 61) 

ACTAATCCCAGAATCCGGCCCCAAACCrrTTCrAATAACTACaTAAAGCG^Q^^^ 
AGACAGATniGAGCrQGACrCCIGILI ICI IC3l(jQGTCA(jCTl«:V\TAAAAAQCrnTC 
TTTTCTCAACACCnKnVVTrATAGrATT^ 

TCGTGGGTGACrATTLI IGI ICGCTGATATTTCG^T^QGCCAAAATATAAACC^CTTAGA 
TCAAACrrCAGrACGrAAATCGaa:C^CAGMTOnXJTTGACAI I I I ICrcmOGATTATA 
[G.A] 

CAQGrrACTTTAaXWVTACCOTAQGCAGnATAACACACT^ 
CATAGAAAAGATAOVGrAAAAATATOGTAAl 1 1 1 1 1 IO\ACTTTTAGlTXy«3ATTTQGAG 
GGTATUreOkCATTTXjn'ACAAQQGTATATTCC^^^ 

AACCCTXn"0^CCO\QCTAGTX3AGCATAGrAC(XAATCGATAAI 1 1 1 1 CAACCCTTCTCCA 
TPCCCTXDCCCJbl ICI IGIAGFCCGOVIjI I ICIGLI 1 1 ICGCATCTTTATATCXXTrcTCCA 



(SEQ ID NO: 62) 



CrCMCACCTOGTATTATAGTATTGACTTCrAGrrCATCGGGCAGCAAGCCCC I I I I GGT 
OGGrGACTATTCI IGI ia5CTCATATTrG(^TraK:CAAAATATAAACCrCTTAGATCAA 
ACnX^VGTACGTAAATlKiGGCCACAG^TQCrcreA<^^ 
GTTACnTACTCAATACXXnTVQGO^EirrATAAC^ 
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AGAAAAGATACAGTAAAAATATOGTAAI I 1 1 II I CAACTTTTAGTTGAGATrPQGAQQGT 

[G.A] 

TlJRXTVCATnurrACAAQQCTATATTTKIATCATOn^ 

arSTCfiCCCAEGrAGrGN3CAJ/^ACC<M^ 

, . CrCCCCGI ICI IblAGTCCCCAGI I ICHjCI 1 1 I CCCATCnTATAT CCGTG'nKACCCC 

ATGrrTTCCTCCCATC^ 

ATTQQCrrAGGATAATOSGCripVQCreCAT^^ 

15871 AQGAGmATONATTTTATTAGTmTTOWVGAACCAILI 1 1 KiQLI I IGI lAATCCTC 

: . CCAATCblGIGI I MCI I ICTCATTALI I 1 1 lUnrnTATTTCCrTCAALI ICI II I II 

. , . GCTTAATTTTAAAATAAI I ICI IGAGATTGAGATAAGCCTCAATGATOGGTCACCGATTT 

CCAGTCTTTCTrCTTTTCTAATTATGO^TTTTAAAGCAGAAATCm 

TTTAGnGOVQCrOVONAmrCAGATXnUrCTCrOVC^^ 

[A.G] 

TGACOVTGAAACCATCCAGrCAO\ATUTGGCATTAI I I I I I IAAI I 1 1 I I I I 1 1 I I I 1 1 I 

TGAGATAGAGTTTCACrCmTTGCCTAGQCITjGrenH^^ 

AGOWXlX3C>\(Xra:OVQGTTO\AGGGAT^^ 

ATTACAQGCATQCGCCACCATGCCCAACrAAl I I IblATmTAGTAGAGATQQGGGTTC 

TCCATCrnQGTGvsGrrGGrcnxw^crcccGACGrcAQc^ . (seq id N0:64) ^ 

19244 . GTCGOVTTAI IGbl lOVTAI 1 1 1 lAI 1 1 1 1 lAGACTTCCTTAATQOWVACATATACAGr 
..... TGATCCTD^mTTTtmy km TCT A TT^^ 

CCAAAGTAACCCOWVVTATATACrCAG^GrACTrTCCi^GGO^TTCATQ^ 

GAGCACTXSAAAAAOTCAGrrGCrCAGCATGTACATTCCrAOT^ 
ACTGTGCCI ICI l(al I (OVGCTOWACTATTAACrAGOVACTATCC^^ 

[G.A] ^ , ^ 

. 1 1 IICK/jOVLiI 1 1 1 IGOVI 1 1 1 IGIAI 1 1 1 IGliGGTAATrFCjCI III lAAAATGTTGC 

: . CGAAAQGTAGraCTTGAAGTXXnXjrCTAGrcrrCCTAAGTGC^^ 

. . TTATGGAGAAAATATATXK]GnX»VTAAGCrrTTGCCCCAAATT(^ 

(^GCACACATTAAATCAGGreCCTTCAAACAGAAACAGACATAAGACATOGT^^ 
i AATCAGrreATCAAAGrcn^^ 

19387 . CrCACAGTACritCCCAGGC^TTG^T^SACATCCACAGAGCAGTXSAAAA^ 

TCAGC^TCTACAmCTAGCrAGrAGAATAAQGOWACTCrGCCI ICI IGI I ICAGCTC 

TtAtACTATTAACTAQONA(nATC<XrrTCAAQC^ 

. , tTGTAI I I I IGI IGGTAAtTTCCI 1 1 I lAAAATC^rTCCCOWVOGTAGPGCTGAAGrXSCr 

GTCTAGTUrrCCTAAGTGCAAGAAAGCOVTAQCATXKICrTATGGAGAAAATAT^^ 

■ [T.G] ; 

. GATAAGCrrTQCCCCAAATrCAATCrrAGreAATO\AG\^ 

TTX:ftAAO\GAAACAGACATAAGACATOGmTCT^^ 

ATCAGAGGCrOVCAGGAACCTAACCCIGI I I 1 1 CCTCTAQGAACAATQGTTTXSGrATTTG 

CTAATTCAGrcrrTQCAATGAATATAGAACrrTATQGAA^^ 

GAATTAACX^TATCTOTTAAGACTXa^TTtCrA^ (SEQ ID NO: 66) 

19447 TCAGCAlOTACATTCCTAGCrAGrAGAATAAGGCAATACT^ ^ ICMG l I IC AGCTC 

TCATAGTATTAACTAGOVAGTATCCCrrrCAAGGrCrATTTTGTGCCAGI I I I IGCATTT 

TTXnAI I I I IGI IGGTAATTTCCI I I I lAAAATGTTCCCCAAAGGrACTGCrGAAGTGCr , 
. GrCrACTGTTCCTAAGTXXAAGAAAGCCATAGCATGCCrrATQGAGAAAATATATGCGTT 

QGATAAQCrTTCC0a^WaTCAATGn7\lGrcAATCAACAQCACACAT^ 

[C.G] 

... TrCAAACAGAAACAGACATAAGACATCGmTCTATTAATCAGrrxyVTGAAAGTCnOT^ 

ATCAGAQGCTCAOVQGAACCrAACCCIGI 1 1 1 1 CCTUrAGGAACAATCGnTQGTATTTG 

. . CTAATTCAGIGI I IGCAATGAATATAGAACTTTATGGAAGATGATRCTCnTyNATAATGA 
. . GAATTAACCATATCTCmAAGAGrcCATn'CrAAAGGAGAATATTCAGAAGQGrATrrc 

CATAAI I ICI I IACTAACAGATGCTXX:crCTCACTGTCCrTACATQGTCCAGATTCrO\T (SEQ ID NO: 67). 

20076 TO^CTXIAGAA TCCTGTC ATCrCCrC CAGGGTC CITraTCCAAGA AAGTCrA TCCTT^ 
. . CACTAACAGTAAI 1 1 IGGTCTTCCICI 1 1 1 ICIGGAGAAGTCAGCIGI 1 lATGCIGCI IC 



(SEQ ID NO: 63) 



(SEQ ID NO: 65) 
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AGCACCAGACCCrcrCTTACI I Ibl 1 1 ICil I ICATTCI 1 1 1 1 CATGrACAOTAGTCTTAG 
GATTOOVre^CTCTCA^^ 

CTATTTTATTTTCTATTrrCTCrrcCICilCI ICIGAI IGI ICTOLI ICIblCCACAAACA 

[T,C] 

.GCTCTAATTTCCCrAGrATTAAAAAl I I ILICILI 1 1 Ibl IGI ILI 1 1 lATCCTTCCrCC 

OTA 1 1 I I lACreCCAGATTTTTAI 1 1 1 rATTTAtFTA H I I IGAGATSGAGTCrCACTC - 
TtJTC^CCCAQGCnmnGCAGraGaXlGATO'OV^ 
CTrCAAGONATTTTCCrCTTTTAGCrrCCCAACTyV^^ 

ATXX:craxnX5ATTTTTCrATTTTTACTAGA^^ (SEQ ID NO: 68) 

20492 oxcrcrcrcAGCCAQQcraQQGTGCACjreGaxiGATax^^ 

CCCAQCrrCA/VGCAATTmCrcrmAGCCTCCCAACnA^^ 
CCACCATXX:CTCGCreATTTTT<TATTTTTAGrAGAG^ 
O^CTXKTCrCTAAOXKrrGACCTCAGGrcAACiCACCCGCCrCAGCC^ 

GATTQCAQGrcreAGravcrcTCcci^^ 

[T.-J 

GIGLI I lAGCTCTAI I ICCICATTTACTALI ILILI I lAACTOVCHlCATATATCAIbl 1 1 
TCCATAGrAAATCTCrAGTAATrnTTAAAAATCTAGAAATA^^ 
AGATCCrACnTAATTXWVmATOTXKSAGTTAGAATATCrT^^ 
TCCTACrrcrrAATTAOVTTACrTGGrAAGGCCACnxnXWW]^ 

ATATTATTTATCTATAA(3GCrcTTACAATTACreAATT^ (SEQ ID NO: 69) 

20868: TAGTAATTTATTAAAAATCnAGAAATAQGTACTTITA^^ 

TGAATTTATCTTGGAGTTAGAATATaTGATTTGGATTrrAGrr^^ ICl lAATT 
ACATTACmSGTAAQGCCACI IGlCaAAGrC^GrCTCrrTCGAQGAATATTATTTATCTAT 
AAQQCTIGTTACAATTACTGAATTTTAAAAAATCICT^^ V 
CATTTrrAGrATTOVTCnXjGGATAGQCATT^^ i 
[T.C] 

AATTTTQCCrrAATCACnTTAAAQCTrrCTCTrAAATCAGAGAT^^ 
ClblbbI ILI lATCAbl ICIGACTTTTAI I I 1 1 IGCGLI I I I lAI I 1 1 1 1 lAAAGGAAAA 
ATTGAGGCTTG^GAAATTXrrCCAGrCrCrCOVGACACreQGr^^ 
0\ACK7\GAGTTCATTCrrOSAAQGrAAG^ 

TTAATATCCnOCyaTAGAACifO^^ (SEQ ID NO: 70) 

20941 . GAGrrAGAATATCrnGATTTrSGATTTTAGI ICKjCTALI ICl lAATTAOVTTACTTQGrA 
AQGCCACTTUTCAAGTCAGTCTCTTraGAGGAATATTATTTATCT^^ 
TTAa^TTTTAAAAAATCrcrATTTA! II II lAATCTAI II G I lACATmTAGTATT 
GATGnXSGGATAGGCATTTAAGCAAGTCrATAACrCACCTACATX^^ 
ATCAGrrTAAAGaTTCTCTTAAATGAGAGATTnG^^ 
[T,C] 

CAGI ICIGAGrriTTAI I I I I ItiCCLI I I I lAI I I I 1 1 lAAAQGAAAAATTGAQGCTTCAG " 
AAATTUrCGVGTCrCrGCAGACACTQQGrCTCACTA NIL! bAACAACAAGCAGAGTTGA 
TTCTTCAAAQGTAAGCtCTrCATCrrQGrCAACAATrGACm 

TTAGAACTCKnxnTnnAAGrcreocrrrAAAACACcrcc^^ - " 

TOTVAGATLI 11 1 IGlLl 1 1 11 ICCTCGCATrcAl 1 1 IGlATGrUTACMTrATCTAAAG (SEQ ID NO: 71) 

•21116 . GrATreATCrraGGATAGGGATTTAAQCAAGTCTATAACrCACCTACATGC^^ 
CCrrAATG^GTTTAAAGCTTTCTCrrAAATGAGAGATTTGAAATTCATAATT^ 
TCrrATGAGrrCTGAOTTTTAI I I II IGCCCI II I lAI I I II I lAAAGGAAAAATTGAQG 
CTTCAGAMTTUrCCAGrcrCTCCAGAO^CreGGrCTlGACTAI I ICIGAACAAOVAQCAG 
AGrreATTOTCAAAGCTAAQLICI ICATCTTOCTCAACAATTCACrTTCACriTAATAT 
[C.T] 

CreCATTAGAALICICilGI I IGlAACTXntXXnTTAAAACACCTCCCTAGTXTrCArrAT 
GTATATCOVAGATLI I I IIGICI 1 1 II ICCrCCCAmATTrnGrA-TCTTCTACATTTATC 
TAAAGrcTAAGAATQQGAAGrcrAAGCTGAGACTGGACrCI I ILI 1 1 CAAQGCCTCAAAG 
GATAGT GGAAT GGCAGGAACTAAGCjI I I lAACTCG^TAGATGAGGAGCTGAAGAGI 1 1 l(j 
GrcrnxrrTTTCrcCATnWTTCTAATTnGACAGTAAAACT (SEQ ID NO: 72) 
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21701 CATTCATTCAAACTMGMGACTAGOVGATTOVT(^CATTAT^^ 

. GAAAAAAGQGAAATTACTAAaTCrCCMGCTMCAMGAMTACaGnTAAA(^^ 
. . GAAAAOVGAAATCCAAATTTGAACCrrATTOTCTCQQQO^^ 

0«3ACrrmTACrCTTAAIlil 1 1 IGI I ICATQQGATAGAGOVCTAATCTCTCCAGCCCA 

GCn^GCrcrCAAATACTLIU IGCrATAAACAOVGGGOVGGAACTGAI 1 1 1 1 lATGATAAC 
• , [G.A] . ~ ' ' " _. ~ • - - 

TAAAAONGAAAAQGACAATTATATrcrATTAATAI Ibl KjIdAATATTTTCAGrCCrCAC 

ATTTntrAAAAATCrrTCTAAATQQLI I IGI lATTXyNATrnTaX^TTTTATATCTCTX; 

OZAAOVQCATTTTXAnxrnxnmDVrAAl I ICI 1 1 lAOWVCAQCTQCrCA/VGAQGA 

AGGCrcAAAGrCTO^AQQCreAGCAaJ^AATGA(.l I I IGI lAGTACTAGATCAGAAQQGC 

tttccreaqgamtcaaaacoaaaacatgaaaagaagataaao^ (seq id no: 73) 

aaactaagaagacta(x:agattw(^cattatttaacc^^ 

GAAATTACTAAGCrcrCO\AGCTAACAAAGAAATACCrcrrTAAACrr^ 
AATQOWVTTTXyVACCmTTUrCTXmSCAATCACJITTX^ 

ATACnCTTAAIbl 1 1 IGI I l(^TQQGATAGAGCAGrAATCTa«^\GCCG^QGrecrCTC 
AAATACrCremiCrATAAACACAQQGOVQGAACr^^ 
[A,-]. 

AAAQGACAATTAT^TTCTATTAATAI lb I IblGAATATTTTCAGrCCrCACATTCTCrAA 
AAATCTTrCTAAATGGLI I ICil IATny\ATmTCrCATTITATATCinOTGC(:AACAG(:A 
TTTTCATCCrrTCTCTTCATAAl 1 1 LI I I lACAAACAGCllQCTCAAGAQGAAQGCTCAAA 
GTCrCMGGCTGAGCAOGTAATCA LI 1 1 I Cil l A tnACTAGATTyVGAAGGGCnTOCTCAG 
GAAATCAAAAGCTAAAACATCAAAAGAAGATAAACAGQ^^ (SEQ ID NO: 74) 

0^GAAATGCAAATTTX5AACCTrATTXn"CTGQQGCAAT(^GTT^ ' 
TTTTATACraTAAICjl M ICil I I CATCQGATAGAGOVCTAATCTCTQCAGCCCAQGreC 
TOOWVTACTCTUrTCCrATAMCACAQQGC^QGAAC^^ 
AG^GAAAAQGACAATTATATTGTATtAATAI lb I IGKaAATATTTTCAGTCCrCACATnS 
TCTAAAAATCnTCTAAATQQLI I IGI lATrcAATTTATCrCATTTTATATCTCTTQCCAA 
[C,T] 

ACKATTTTCATXiaTTCrCTTCATAAl I ICI 1 1 lACAAAOVGCPQCnCAAGAQGAAQGCT 
CAAAGrCrCAAGGCrGAGCACGTAATCACI I I IGI lAGTACTAGATGAGAAGGGCTTTCC 
TGAQGAAATXSAAMCCTAAAACATXWW^GAAGATAAACAGAATTTGGAG^GT^ 
AGAQCATATAATATTL I GC 1 1 CTAAAGTAATATTCrTCTAQGAAAGTCAQQGGC^ ^ 
TO^CrerrAGGOCAGAAATCArATTOCTAT A I 1 1 I C I I IG ATAGCTTTAGGAATAATGCA . (SBQ> ID NO: 75) 

21840 . TGAACCrrATTUrCTXmSCAATGAGTTTTGAGrATTTAAGTCAGAC^^ , 
... IGII I IGII ICATQQGATAGAGCAGTAATCTCTXSCAaiCCAQGTXXTCrCAAATACrcre 

. TTQCTATAAACACAQOGCAQGAAaGATTTT^^^ 

TTATAtTXnATTMTATTGTTURyVATAT^^ 

. . TAAATQQCI I IGI lATTGAATTTATODVTTmTATCnnXXIC^^ 
. [-,T] 

TTCrarCATAAl l ICI l l lACAAACAGCTXKTCAAGAQGAAGGCrCAAAGrCrOVAGQC 
. . TGAGCACDGTAATCACI I I IGI lAGTACrAGATCAGAAGGGCTlTanGAGGAAATGAAAA 
(XtAAAACi^TCAAAAGAAGATAMCAGAAT^^ 
. TCTXKTTCrAAAGrAATATTCrTCTAQGAAAGTCAQGGCGTTTCC 

GAAATCATATTCCTATAI 1 1 ICI I IGATAGCTlTAQGAATAATGCAAATTarAAGCCCAA (SEQ IP NO: 76) 

21841 . GAACXT TATTXJraXSGGGCAATCAgTTTCACrATTTAAGrC^ 

Gl M IGI I ICAT^SGATAGAGCAGTAATCrCltKAGCCCACXrrGCTCrCAAATACT 

TCCTATAAA(^CAGGGCAQGAACreATTTTmTWAACGrA^ 
TATATTCTATTAATAI IGI IGlGAATATTTTOVCTCCTCACATTCTCTAAAAATCrrTCT: 

AAATQGCI I IGI lATTCMTTTATCTCATTTTATATCTCnR^ 

[-.C,T] 

. TCrClTOVrAAl I ICI I I lACAAACAGCrXOCAAGAGGAAGGCTCAAAGrCrOVAQGCr 
. , . GAGCACXHAATGACI I I IGI lAGTACTAGATGAGAAGGGCrrTCCreAQGAAATGAAAAC 
. . CTAAAACATGAAAAGAAGATAAACAGAATTTXiGACACn^ 
CraCrrCTAAAGTAATATTOTCrAQGAAAGTCAQQGCm^ 



21710 



21826 

/ ^ . . . . 
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AAATCATATTCCTATAI I I ICI I IGATAGCTTTAQGMTAATGOWVrrCTAAGCCCAAG (SEQ ID NO: 77) 



21843 



22045 



22061 



22348 



22682 



ACCTTATTCTOQQQGCAATOVmTCACrAmAAGrCA^ 

TTTCrnOVTOQGATAGAQCAGTAATCTXmy^ 

-CrATAAACACAQQGCAQGMCreATTTTTTATW 

TATTCTATTAATATTl^rnjreMTATT^ 

ATTSGCTTTGTTATTCAATmTCTXIATTm^ 

[-.c] 

TCrrWAATTTCrriTACAAAOVGCTXjCrCAAGAC^^ 

GCAOCTAATGACI 1 1 Ibl lAGTACTAGATCAGAAQGGCrrTCaTGAQGAAATGAAAACCr 
AAAA(^TCAAAAGAAGATAAACAGMTTTQGAO«JlT^^ 
QCnCTAAAGTAATATTXTrOTVQGAAAGTXSAQQGGGT^ 
ATCATATTCCTATATtTTCrnWAQCriTAQGA^^ 



ATATTTrCAGrcCrCACATTTGTCTAAAAATCrTTCrAAATGGLI I lb I lATnGAATTTAT 

CrCATTTTATATCrcnKICAAOVQO^TTTTC^^ I ICI M I ACAA 

ACAGCTX^CTXy^AGAGGAAGGCTCAAAGTC^^ 1 1 Ibl lAG 

TACTAGATlWWVQGGCrrTCCrcAGGAAATCAAMCCrA^ 

CAGMTTTCGAGAGrXSAGATATAGAGCATATAATATTCTGCrrCT^ 

[C.A.T] . 

AQGAAACTGAQQQGGTTTCCaXiGCTUrrAQQCOVGAAATCATATrCCr^^^ 1 1 ICI! I 

GATAQCTT^AQGAAtAATC0WVT7OAAGClGG^AQGT^^ 

AGCTTAGCTCCCATXSACAAAATACCATAGGCreGATGCATTAAAC^^ 

TrrCAO«3CTCTX5QGAGCTCQGAAGrrrAAGATGAGA^ 

TGAQQQCt01CTTTCreGCrnKy«^^ 

CATTCTmAAMTCTTTCrAAATCG CI I I GI l A TTt^mATCTCATTTrATATCnST 
GCCAACAGG^TTTtWCCTTrcrCTTOVTAAl I ICII I lACAAAO^GCrGCTCAAGAQG 
AAQQCrOWVOTCTtAAQQCTCAGCAiClOTAAT^ 
CrntlCrSAGGAMTCAAAACCrAAMCAl^^ 

AGATATAGAGCATAtAATA m i GLI I C fAAAtnAAtATTCTTCTAGGAAAGna^GGGClG 
[G,T] 

TTCCCraGCrcrrAGGCCAGAAATCATATTCCTATAI 1 1 ICI 1 1 (aATAGCTTTAGGAATA 

ATCCAAATTCTAAXCCAAGCTTCAGMTAGACTAAGAAOT^^ 

CAAAATACCATAQQCreGATCCAtTAMCMTOGAAATT^^ 

GCTX3QGAAGmAAGATCAGAGTCCCAGCATC(jrrQQGrrCTAG 

GGCrTCCAGATAGACCCCrrCTCACTXJrATTXjrCATATC^ 

GAAAGTCAGQQGmTCCCraKnTGTTAGGC 
TAGCrrTAQGAATAATGCAAATTOTVAQCCCAAGC^^ 
CTTAGCTTKiCATGACAAAATACCATAQQCTTQGATO^TTAAAt^^ 
TC^GACSGTCTQQGAGCreaSAAGTTTAAGATCA^^ 

AQQGCrcrci l ICHiGCmSOiGATAGACCrCI ICICACrcrATrcrcATATQQCAGAGA 
C-.A.G] • ^ 

AGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGGGATCTTTCTCI IGCI I ICTATTATAAGG 
COOAGTCCrcnXXSATCAQQGrrCCATrCTTATGAC^ 
GATQCTATCTCO^TATAATCACAClQCnTQQGTTAQCXKICTC^ 
QGAO«y\GCro«nX3CATAQOWVQ^^^ 

CACAAI 1 1 1 lAATATAAATATTTTATOGTAACI II II II II II 1 1 KjAGATGGACTXTAG 



ATCTTTCrcl ICC 1 1 I CrATTATAAQQCCATAGTCCI d I ICGATCAQGGTTCOXTTCTTA 

TGACrrTATTTGACTmCCCCCCTAAGAT«TATCTCO\GATATAATCA 

TAQQQCCTXiAACATrTCGArmSQGAGQGACAC^^ 

AGAGQGTTQGATATrTAAAACTAGCrACACAAl 1 1 1 1 AATATAAATATnTATOGTAAGT 
I II I I 1 1 1 II 1 1 ICAGATOGACTTCTAGCraxniXKICCAGGCTCKJAGCGCM 
[A,G,T] 

CTCAGCrOOQCAAGCTCGGCXnCOZAQCrTOV^^ 

AGTACnrOGGACTATAGGOVClGGGCO^ClOUDG^ 1 1 1 1 1 1 1 1 lAI 1 1 1 lACTA 



(SBQ ID NO: 78) 



(SEQ ID NO: 79) 



(SEQ ID NO: 80) 



(SEQ ID NO: 81) 



r 
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GAGAGQQGTTTTKACCATATT^JTCAQGCrrOTCTCXWKCT^ 

CCCATCrnOKCTCCCAAAGTXKTOiGATTAC^ 

. . GCrrrAOnGAGATCTMTTCACATCCCAT^ (SEQ ID NO: 82) 

22783 ATATMTCACACQGreQGTTAQGGCCr(^CATTT^ 

CO^TAGCAAAGGATMTCCAGAGGG^^ 1 1 1 lAATA ~- * 

TAAATATTTTATQCTAALI 1 1 1 1 1 1 1 1 1 1 1 1 IGAGATOGAGTCTAGCrLltil llaCCCAQG 

Cn3GAGC3QCAATCCTO:GATCrOVGCTCACreCAAC 

Tcrccrcxiau^crccTGf^ 

; . [-.T] 

. 1 1 1 1 1 1 1 1 1 ATTTrrACrAGAGAOiQGrmSCACCATATTCGTCAQQ^^ 

COGACATCAQGR^TCCACXCATCri^ 

CCACOXICXICrAQCCAGCAGCrrTAaXSAGATlCnAATTC^ 

CTAAACTATAC^TTCAGTCACTTAAAACArTTATTTAl I I I lAAATTCACAGAATTACA ' 

TGTATTTATCATTnV\lCAACATGATimT7X3^^ (SEQ ID NO: 83) 

23448 . TTCrCTTACnATTlT^ 

TAGraGACKTGTCAACTTATTCCrCATCrCAAGCTXSAAATTGTT^ 

ACCATACCCGACrCCCAAAGrATTCTCCrCJaOTTCTAtGAGATTAAt.1 I I I ICTGAT 

. TCCAOVTGAGreAGATCATXXyVGTATTTAI I IGICI I lACCrOGCTTATTTCATTCATAT 

TCTTACAGATAACAQGATTTCLI ICI 1 1 1 1 1 IAATQGCC3GAATAGrrTTCrATTCTATAT 

... [A,G]. 

TATAGCACATTTTCTCTCTTC^TCCATTCGTlQG^ 

. . TATCGTGAATACTGCTATAATGAACATQQGAATXKAG^TGGCTC^^ 

r. CATTTTATATATCnxnATATATATAlT^^^ 

AQGATWATCCnVSGnCTATATTTAATTTTTAAA^ I ICCATAAT 

GQCRnAtrAGTTTAACrcOTCACOyWIAQQGn^^ (SEQ ID NO: 84) 

24960 II IGI ICrAGAGrATAGTTTAAGtCTCAIGI I ILI lACTGAI 1 1 ILIGI IGAGATGATTT, ' 
. grCTAI IGCIGAAGGTAGGb lGI l(jAAGrraX CTAiCrATTCCnnATT^^ 

tCCnTCAGACXJrATTAATQIjI I I I lATTTTATTTTAI I lU IGI ICjI Ital IGI IGI ICal 

TCrrcrnTTGAGAGQGAGTCTC^CTaOT^^ 

QQOrOKCrQCAOOOOOjSTCrOi^^ 

......... [G,A] 

aX3GGACTACAGG0GCATG0CAa^0GCCC>GCrA A I 1 1 1 l blA TTTTTAGTAAAGAC3SG 

GGrnO^CCATGlTGGCOV«y\TGGTTnTCATO"Crny^CrT 

.. GCCTCCCAAAGTXXnXjQGATTACAQGrcreAGCCACCACCCCrGGCCAAIGI I IGGTATT 

TATCT!TAGG7T5CrCTlGATGnXK5GrrCATATAT^^ 

CTtATTAAQQGATATXKMTATAAAATATATAAATTC^ (SEQ ID NO: 85) 

24983 . . TCTGAIGI I ICI lACTSAI I I ICIGI ICaAGATGAI I IbiCTATnGCreAAGGrAGQGTXJT y 

TXy\ AGTCGCCrACT ATTXKTCrATTGCACTCTCrcrCrCCrrTCA^ 

: . TTTATTTTATTTTAI I IGI IGI IGI IGI Ibl IGllbl ICil Ibl 11 I KjAGAOOGAGTCTC 

ACTCTCTCACCAQGCT^SAGreCAGraGOVQQGrCrClQ^ 

. . CQGrnCAAGGGAlTOX:CtT3CCTCAGCrrcCC!^^ 
[T.C] 

O^GGGCCAGCTAAI I I I ICjIAI I I I lAGrAAAGACGGGGTTTCACCAIGI KjGCCAGGAT 

GCTcnwomiyvcrrcATWG^cccxsCcnxKsccrcccAAAi^^ 

AGGTXJTGAQCCACCACCCCTQGCCAAIbl I IGGTATTTATCITrAGCJreCTCnGATCrrTG 
GGrrCATATATATrnTAAAAAAGAATAGCTAO^TAACmTTAAGQ^ 
AAATATATAAATTXnCACACreAAAATTTAAAATQQGAQGAGra^^ (SEQ ID NO: 86) 



25390 



AGTCCTCQGATTAOVQGTTnGAGCCACX>VCCCXnT3^^ 

GTCCrCTCAICI IGQGTTCATATATATmTAAAAAACAATAGCTACATAAGmTT^ 
GGATATCCVOATAAAATATATAAATKnGACACTXyW^T^ 
GTAAAAGTACCrrCATATAACmCTATTATATCCTCTrATTGAAT^ 
TTATATAQGAALI I IGI I ICTCCnTAONALI ICItACTTAAAbI I IGI I I lATATGATA 
[T.CJ 
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AAGTAAAGTTACrCCTCCrCTCCI IIGbl IICIGI I ICCATQGAATATCI I I I ICCATTC 

arOVCC^TCAGTCTCTXJrGTAI I I I IA(^GATCAMTGAG^Crc^G^TQGGCAGCATAT 

AGTTCGATCTAtil 1 1 1 1 1 lAATCCACTCAGACALIGICil 1 1 1 1 IbATTQGATAATTTAAT 

C(^TTCATglT OVAGGT AAmTTWAAGTAAGGALI I ICjIAGTACCAI 1 1 IGLI lATT 

^ , ..Girro^TOGrrm (sbq id no: 87) 

26060 . . Gb I I 1 1 I Wj I 1 1 b I (ib I I ACCAAGAQCJrrACAAAAMCATCrrAAGAGrrATAATA^ 
ATTTTAACTTGATAACnAATTTTTATTXKAAAMCCCC^ 

TmACrrAATOCOCTXSAAATTrPGA A l 1 1 1 Ib ATSTC^CAGTTTAClClXTrTTCATATr 

GKnATCCCrrAMTTATTUTAGCTATTATTAaTrr^^ 

AGAlCTAAGTCyVTTTTSCATACCATCATrACAGrATrATTrrTy^^ 

[C,T] 

TmATOtfXICAGrrTTATACrrTCAGAICil 1 1 1 IGlbl I ACrCATTAGCATCI M I ICT 

TTCAGCrrcAQGAGCrccrnTA(Jbl I lATAAAATAQGnmnUVTCATTATCrcC 

CrCAGCTAI Ibl I lblCI(3QGA(\AGTATCTCrCCTTCAI I ICIGAAQGACALI I IGLKjG 

. . . 1 GTACATTAGCCI l(Jbl ICjOTAI 1 1 1 ICrCCrreAACXXTrTAAATATATCATCCCTTTCr 

* CTOTCACCTGrrAQGTIOCTGaiSAGO^^ (SBQ ID NO: 88) 

30245 ATTTTAACCATCCAI ICill ILIGLI ICTCTAGATAACCCrGACrAATATATAATKSGrAT 
. , GAAGTC ATATCrCATGGCTTTGATTTATAI IILII ICATOGCTAGTCALI I I II I lOTAC 

I I I IbQGATAI IGI lATTATTATTATTATTATTACTAblGI I lATACI ILI ICAGTAA^ 

GKnTAGAAACAATTTTTAAAGGONGAATCTGACC^^ 
TCATQGACCTraxrCAAGTOGrAAQCCATr/^^ 

[c,G] • 

TTCTTTTCrrCO^tTTCAaxrrCTCTTTCnOT . , .. ' 

CTX5AATQGnXntAATAT«3TTTQGATATTT^ 

ACCTCCAGrcnTQGAAGTAQQGACrACTTQQGrCftaS^^ 
TTGCTAATAAGreMCTCTATTAGTTCATTW^ 

TCATTTCTcrnCTCcrrcrCTCACCATCTCACACAcrrGcr^^ i ii i ici icagcca (seq id NO: 89). 

33664 TTOCACyVSTOTAGAACTACAaOTCCT^^ 

CAGACAGTATATGAMO^QQGAAATTAGAQCKX:^^ 

TTTAAAGAAAATATTAGCAMOTyNATCAGCCATrrTAAAAAATATACCA > 

: ATTCATAAGAQCACKTrAACAAAATTTGTTAGAAQGOmAAAGA^ 

AAGATCrACC^rOOa>^AAtTOGTWAGAGAT^^ 

[G.T] 

GrrTTTTTGAQGAA(XrurG^AGaGAGrCTCAAATTTATATOWVGA« 

GMTATCCAGGACAmaXWVGAAOOTAAGGAGCCAGGGGCaOZOT r 

: AAQQGlTGrrATrAAGCCATAACCAAOTCAGreCTX]^ 

CAA6TXyW\C>TAAtAGAGAQCCOVGAAACAGACX]^^^ 

GAAAGAACnAGCrrnQOW\ACTTra^^ (SBQ ID NO: 90) 

33883 . TAMGAAGACTCAGTATAGAAAAGATXnACCrrcrCrCOVMTTOJ^ 

TGCCATTAAAAAAAC<XACLIU3l Mill l(j<\QGMCrrGrOWKnTyVGTCrOV\ATn: 

ATATCAAAGAGCAAAQGCCTAAGMTATCCAQGACATTCCT^^ 

GGGGCCraCCCTATOVGAtACCAAGGbi IGI lATTAAGCCATAACCAAGrCAGrGCrGTT 

: TCTACAGAM(:AGA(^GrrAACAAGTCAAACATAATAGAGAGCCC/VGAAAC^ 

[C,A] 

. CATATTTTXKSATTrcrOVGffreAAAGAACT^^ 

GTtnXXWAGArcATQCTOCTTKTCAnxyVGA 

TTACGCTACACAAACACCAACCTAMCGTOWVnTAMCTATAAO^^ 

. GQGAAGAAATATCTTTATCTCAGrcTAGGGAAGMTmTmAAAM^ 
GQCCATACAT/VQGMTtyW^AGAfrawrr^^ (SBQ ID NO: 91) 

34373 TATmTATCTCAGTTJTAQGGAAGAATmTTTTAAAMGAAGACACAAAAGG^ 
. : TAGGAATCAAAAGArreMTTCAGOlKATrAAAAAGATTAAATTCA^ 
OVAGAQCATQTnACrTXjGAOVQCATAGAGraGAAAG^^ 
TTAtAACrrGAAQGATTAGAATXyWTWATAAAlGAACrAT^^ 
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A(^CaxrrTAGAAAMC3QQGOW^GA(^TTW\C^QCATATTTC:AC]G^^ 
[G.A] 

GTAGCAAATOVVCATOnyWSWGATGCTXIA^ , 
GITATACCC^CAGO^AGACrATCrrATCrAGGAA ^ I IfalC MTACXXTAAAIGI ICIGI 
. GG mTAAG CTACAGAfa I j I (a I AATrp^TrTATTTATTCAATAAATACTCAGrGGCAGGC 
... A LIb l 1 1 lA GAAAC L I lA TMCTliPGAATCAMTTAAAAAAAATCC^ 

GfiGSfij'CcnATGrGToaasfiGn^ . (sbq id no:92) 

34558 . ACTTCMQGATTAGMTOVATWATAAAGAACr^^ 

a3GTTAGAAAAAC]QQGCAAAGACATT^CAGO\TATTTCAC]GTX3^^ 

, CAAATCAAt^TTSCnVVAGAGATXaCrCAACAOGrrTACT 

TACCCAO«K>VAGACTATCTrATCTAGGAAGrnXjrCAA 

TTAAQCrACAGAGTTTUrAATTCATmTTT^^ 

[G.T] 

. TTTAGAAAC C I I Gb I I A TAACTTTGAATGAAATrAAAAAAAATCCTTCCL I I G I (aGAGGA 

TCCmTCnxnim3AGnX3QGTOGTCQQGTCAAA(^ 

AGTCAC^TAAATAAACCTATAAATATTTK^WXCAGAGmT^^ 
GACTAQGAOCTCATOT^GATATACCTCTXJTTQCTGQGACAM^ 

TTCCCATA^XK:AAGT0^^AATAAAAAffTX3ACACTAGAAAA<:A0^^ (SEQ ID NO: 93) 

43929 . QGCATTTAAGrATTOTKICATAQQGAAGTXJrAAAAGrr^^ 1 1 1 lATAGG 

. TACTATATTXmrCAAATAATCTDVQO^CCTCATCCnT^ 

GGTCAGATTA I G I I I ATCTCTQGCATAAQGCACTTAACMTATrCAWAAAGGmW 

. ATLI 1 1 1 ICATCrGCTTAGCATTTCATACCAGI I Idl I I I COVCCAAACTTTCAAA 

TTTTTGATTtnTTO\TTAATATrCreOVTAGTC^^ 

[T.A] • . 

CreCrcGTiy\AACCXTrAfiGAACTCTtreAAQGAGT^ I 1 1 1 IGII I I IGI 1 1 

TTCrrTTTCTTTTXJrriTTTTGAGAOGGAOT ' 

: :< . . ... TQGTCCGATCTCQCKTCraXK>\AACTCQGCCTCCQQQGrrCAC(K:CATTC^^ 

..... AQCCACaKSACTAGCTmSACrACAQGCACCCACCACTXKjGGaXS^ 1 11 1 II lU 
Al 1 1 1 lAGTAGAGAOjQQCjrTrcACailGI IAQCC7^GGATOn'CT(WCT(XTX3ACCn: , (SBQ ID N0:94> 

44309 . . TreAGACGGAGrCnXKTaUrreCC(^QGCTAGAGTXKA(^^ 

TXK^CrCiQQCCrca3QQGn"CACjQCCATTCTCCre(^ 

ACTACAQGCAOZCACDNaXjaj^^ 

TTTCACCGTXjrrAGCOVGGATQCTCTCGATCrcaGACCTT^^ 

TCCCAAAGreCRKiGATTAG^QGCGTCAGCCACrGnKICCDGGCC I I I I I I I II II I I I I - . 

. [T.-,c] 

: . . . tTTATOSQCTTOTCTTCrACACITOVj^^ 

CAQGAGTKACATraCC^CrACTTVACAATGCCTAAQ^ 

..... TTQGTGATTAGTXKiaTCTCAQCTATGAGrATAAGATAATAT^^ 

: GCOTVGATAAATTXnACAaATCKWNGrrmT^^ I I I I lA 

. ; AQCnVVGlTGATAAO^jrreAGACm (SBQ ID NO: 95) 

44997 GAATnnAAAAATAmTTATAGMTTTmTCT • 

TGAAQQGGTGATCATTTCAAACAATACCfCTCOVTTAGC^^ 

. TCCAICI 1 1 l AAATCATAAGTOVGATTTATAAAAATAI \ I I I ATAAACACTAQGAAATGA 

. GnrtTAGGQGrATTCACATACAGTTTTAAl I I 1 lATTTAC^TATTTAAAACATATCATCGT 

. ATAAATATCATCTCGATATAAATrrcAGATAAAQG^ 
.. [T.G] • 

AAI I ILI lAAAAGATCTCATCACCAGI Ibbl 1 1 I CTAGCCTTATGAAAAATCGnGCAAT 

AAAAAAGATTGACTATWAAAATCGl«:<XrrTG^TTTTA^ 

. ATACTinXSAAincrATCATCAAlXSA^^ 

TTALIGI I lATTTTCAr I l(_l ICIGAACFGATACrxnALI I ICI ICATTUTGAGTAGACA 

ACTTATAATCrATCTACrONAA 1 1 I M AGTrATAAATTCTAGGGAATGAAGmCATATT (SEQ ID NO: 96) 

46538 TGTTATAaTATGGTlCAACACI 1 1 i lATAI I KjILIGIAGAI I ICIGIAOWVAAGATTC 

. . TxyvoomTTAAGCO^CATTOcrrovy^^ 
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GGOW^GCTMTGCTTTAAAGAAAAAGGAGA 

[A.G] 

GGGAI IUjIGICjIGI 1 1 1 Id I lAGGAACAGTAGrMCTITSACriTTAGiAGAACrrGAAT 

AAQCAmATTTTTT<XTTTCraTATTmTT^^ 
. . . . . . GGAnriTCTCTCGMTTTAGriTaTK^WVTn^ 

TTWAOTOOVVGAAAGACTCAC^^ - " 

TnATTAOSaaSMAfiG^VXXa^^ (SEQ ID NO: 97) 

48153 . TTATATCATTLIbLI 1 1 lAi 1 1 1 IAQCnTCA0QGTTCAAAATC/«y\OWNATXyVA<^TAT 

TTGGTCGCrrraSACAGATCCnAAAAGAAQGAQGTATCCQCTCGCrr^ 

CTACAAACDCTCATCAAMTKCTCCTCAGACAGCT^ 

TAATTGTTATCACCCGraGAATTTATTAAO^AAGAQGAGrr^ 

TCTTAATCTATAATOLI 1 1 ItaQGATTLI IGI 1 1 lAATACATWAATCnTCACATATAC 
[T,C] 

CCATAAQGAQGATCACrrATAQGAGATTAGACTAMTAAAATOW^GATTrcrC^^ 

AAGmTQQGATTCrrAATTCATCArATTATTTAT^^ 

TTAAAQGAAGQCnAGMTTTTACnTTAmATTO^^ 

MO^TAAGTTTTATCAAAGTXJTDVCMTCrAACCTCTOW^ 

TCCTTTGrrcrAATTTGACGTlTGCTCTAAAATTCA« (SEQ ID NO: 98). 

48288 . AAATTGCrCCnyVGACAGCnmTWVTTC^^ 

.. GTQGAATmTTAACAAAGAQGACTTTWnAAACJQGAT^^ 

CI 1 1 ICiQGATTLI IGI I I lAATACATCATAATXTirCACATATAGGCCATAAQGAGGATC 

AGmTAQGAGATTAGACrAAATAAAATOVGAGATTrCTCATGACCAAGT^^ 
r *. TTAATTXATCATATTATTTATAAAbI 1 1 1 1 1 1 1 I ICTAAGTAGI ICI lAAAGGAAQOCTA 
.... [G,T] 

AATmACTTTATTCATTCTGAATCXnGAGCAGAAGO^ 

..... AAiVGTOTCACAATCTAACCrCreGAAQGAAAACTAT^^ I I I G I G I AATTT 

GACGnX5CTXJrAAAATrcAGCrcAGrrTQGAGTG^CACCrC<^ 

CrrCrrCCCCATGTACrcCAGCACCrAC^O^GAQGTTC^ 

... GrreTTTWVTCASTCAATCAATGAAC^ (SEQ ID N0:99) 

48412 .. TGQGATTCTTXnTrrAATACATCATAATCrrTCACATAW 

ATAQGAGATTAGACTAAATAAAATCAGAGATTTCTCATCACC^^ 

TTCATCATAlTATTTATAAA G i I M 1 1 1 1 1 I CIA AGTA G I I L I l A AAGGAAGGGTAGAAT , 

TTTAGrrTAmATTCTGAATCaXSAGCAGAAGCAGC^ 

(HTCTCAC^TCTAACCTaXKAAGGAAAACrATAAGrnGAACTCL I I I G I GTAATTTGAC 

[G.A] 

TTCOGTAAAATTCAGCreAGTTnSGAGTXS^ 

, , . . \ TTXXiGCATXnACTCGAQOVC^ 

TGAATOVGtCAATGAATGAACAAATQCATTTACCTCTXWKTCA 

GTTAACrTOSATTATTTGAGCTAI IGLI ICAGCCTAACTCAATCTAAAGGGGAAATACAG 

. AQGrAAmTTAGAGmGQGnTCrcrmTCGTX^^ (SEQ ID NO:lflO). 

48446 CATATACCGCATAAGGAQGATCACXrATAQGAGATTAGACT^ 

TCATCACCAAGTTATGGGATTdTAATTCATCATAmTTTATAA^ I II 1 1 1 1 IICTA 

AGTAGI ICI lAAAGGAAQGGTAGAATTITAGTTTATrCATTCTCAATCaTGAGCAGAA^ / 

. AGCACACTAA CATAAGrr r TATXyW VGrGrCACAATaAACCTCrG ^ 
. AGTPGAACTCLI I IGIGIAATTTGA(JGI IGCIGIAAAATTCAGCTX^I i IGGAGTGACA 
[C,G] 

. . CrCCATGAAGQCAGGGGCGTCGC I I C I I CCCOVTCTACTCCAGCACCrAGAO^yVGCrre 
. GGOGTWAAGriTCAAGGGAGTUrrGAATGAGTCAAT^ 

TCT^TCACrrCrCTCTOGG CI M I G I l A ACrTOGAmTTTCAgcr AnGCr i Q ^GCC ' 

. TAACrONATCrAAAGGGGAAATACAGAGGTAAGrrTTrAGAGTTT^^ 
' CATTAGCAGAACrcrCTAGrnGAGCAGCCACAGATrAIGI 1 1 I CCATTATTTATTCCATC (SEQ ID NO:aDl) 

48456 . ATAAQGAQGATCACTrATAQGAGATTAQ\CTAAATAAAATOV^^ 

GrrATQGGAI ILI lAATTWCATATmTTTATAAAGI 11 1 1 1 1 1 1 ICTAAGTAGI ICI I 
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AMGGAAGQGTAGMTTTTAGTTTATTCATTCTCMTCCTCAGC^^ 
CATA/«jrrTTATX]AMGTOTCACAATCrMCCrCTSGAAGG^^ 
CI I IblblAATTTGACXal l(3CIGIAAAATTiy\GCTCAGTTTCGAGTX3ACACCrcCATGAA 
[G,C] V . 

GCAGOfSGCGTrOGCI I CI ICCCCATGTACrCCAGCACCTAGACAGAGCmXXATXnWA 
AGTlTOVAQGQ^rcmyi^^ 

TTCrCTGTCiQGCI I I IGI lAACTTCGATTATTPSAGCTAI ICiCI ICAGCCTAACTCAATG 
TAAAQQQGAAATACAGAQCnAAGnTTAGAGrmjQGTTCrcm^ 
AaCT CTA GrPGAGOVGCCACAGATT A I fal 1 1 ICe ^TTATnATTOCATC A l I bH lA TC 

48789 . . GCACCTAGACAGAQCrnQGCATCTWAAGrrrCAAGCXyVLillal KaAATGAGTCAATGAA 
TTW^CAAATXXATTTACaxnX^TQ^CTTCTCrgrCGG C I 1 1 l b I l A AaTGGATTATT 
TGAGCTAI l(iCI lO^GCCrAACTO^TGTAAAQQGGAAATACAGAQOTAAGnTTAGAGT 
TTl3GGrrcrCTTTATQCTCATTAGCAGAACTOTCr/«JrTGAQ^ 
TC(^mTTTATtaATCATTUnTATOVAQGACn^^ 

[C,-] ■ 

CCOCCAtAGTnrrGrATTATTOIATCrAGAm 
rCrreAQCAACAGAATACTCTTCAGAAGATTAOGAAGTCC^ I I ICI IIGCC 

. . TAGGAAATAGAGAAGCAAAAAAAAAAAAAAAAAAAAATTAAAGAAAATCrACTCrCCAGG 
ATTTTAATTAGAACCTATCOTOGGAAGGCTATTTrCCXrATATGAAQGrrT 
AAATCATGATTATTAAQQGCTAAIGI I IGAGATACCaTAQGmTTCTGACCACATACr 

48859 . CATTrAGCrCTCAATCACTTCrCTXnCGGCI I I IGI lAACnGGATTATTTGAGCTATTG' 
CrrCAGCCTAACTCAATUTAAAGGGGA^ATACAGAGGTAAGI I I lAGAGTTnOQGTTCTC 
TmTGGrCATTAGCAGAACrcTCTAGrnGAGO^GCC^CAGATTATGm 
ATTOCA TCAI IGI I lATCAAGGAGrXnAAGGGCXTTGAMTrOVACTCCCOCCCCiCATAG 
TTTTTinATTATTCX^TXnAGATrrrAGATTAT^ 
[G.C] 

AGAATACTCrrcAGAAGATTACGAAGrCCAGTQGrATCCI I I ICI I I GCCTAQGAAATAG 
AGAAGCAAAAAAAAAAAAAAAAAAAAATTAAAGAAAATCTAGTCrCCAGGATTTTA^^ 
GAA(XrATCaTCGGA'VQGCTATTrrccrrATAl^^ 
TATrAAQGGCTAATCrrTCAGATACCCTTAfiGTTATTC^ 
GATAGGAAAGCCACAQCCrAAMTAMTAAATACTXIAATCC^^ 



(SEQ ID N0:1D2) 



.(SEQ ID NO: 103) 



(SEQ ID*N0:1D4) 



49126 GATTATTCTOGft GAiLiKj l 1 1 IGI ICI IbAGOVOGAATACTXTTCAGAAGATrAOGAAG 
tCeAGroGTATC C I I I I C I I IG CCTAGGAAATAGAGAAGCAAAAAAAA A AAAAAAAAAAA 
ATTAAAGAAAATCTAGTCTCCAGGATTTTMTTAGAACGTATCCrrQGGMGGCr^^ 
CCXTATATGAAGGrrrXSAAGATrCAAATOUGATTATTAAGGGCTA^ I IGAGATACG 
CrrAQGmTrCTGACOMZATAOTGGATrTrAT^^ 
[A,G] 

TAAATACTCAATQCAGTTATTTCAGTATGCAAGAAGTTTGGTAI I I I I GAAAAAGTCCAT 
GGGTATTGCAAGCAAATATGCACATTTTlQCrrrATGCCATTTGrCAGATTC™ 
ATACCACCAACAGGCATCCrC I GC I I OUTCO^CCGAAGCTCCXrCCrGAGACCTCnTA 
TAGTATIG TCATTTC rGCACACTAACI I ICI lAGACATGAAGAGAAAGCrerCTACACAG 
TGnOGTCTAGI M ICI lATOGGCTOOGACCTATOGIGCIGI 1 1 ICICICCrOCTCCreA 

49378 . TCACCAO\TACTTCGATTTTATWAQGAAAGCCACAGCCrAAAATAAATAM^^ 
TCO\GmTTTCAGrATQCMGAAGTTTGGrATTTTTGAAAAAGrCC^ 
GCAAATAtGOW^TTTTTXTrTATCClCATT^ 

AQQCATOJcrocrrcnnxicAccoNAGCTaTrocr^^ 

TTTOGCACACrAACrnaTAGACATCAAGAGAAAGC^ 
[T.G] 

TTCTTATQQGCrCTXSGACCTATQGIGCIGI 1 1 ICTCTCCTCCTGCrcAAQGTXXATKAr 
CCCTGQQGGCTCTCTAAAAGCCACCTTCCTCTCACAAGCATATACr^ 
AAGCCAGrrCCrCCCCTUrCCAGCCrCCCTOSAGTGCTCAATTXKiAGAATATCCCA^^ 
TCATTCGATGAR5GAAAACCCAI IGI I 1 1 CCCAGTGGATTCTAAATTACTTCGGGGTAAA 
TAQOaxnATATATTCTCAAATTTCCCAGAGrATCTAACTAQ^ 



(SEQ ID NO: 105) 



(SEQ ID N0:1D6) 
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49482 TCCATtiGGTATTtSCAAGCAAATATGCACAl 1 I IGLI I lATQCCAl I HjICAGATTCTTAC 
. , CrrGGATACCACCAACAQGCATCCTCIGLI I CTUTCCACCCAAGCTCCTTCCrcAGACCr 

CnTATAGTATTCTCAl I ILIliCAOOAALI I ICI lAGACAreAAGAGAAAQCrerCTA 

CAeAgre ro GTCT A i Li i 1 1 I C I i A TSGGcrcroGACCTATS (aiu . iCi i 1 1 i L i cicc rccr 

. .. „_.. QClGfiACGTOJiJ^ 

[A.C] ' - - - - 

(TAAGGVTCrCAAT(:AAAGC<:AGTTCCTCCCCrcrCC^GCCrCCCTC]GA(nXKJ^^ 

OVGAATATCCCATTTTTCATTCGATXSATCGAAAACC^^ 

ATrACrrCXXXSGrAAATAQQCrGTATATATtCr^^ 
CACTTT TAGATT CAGATAGAI II IGI ICCnxyVATAGCrAGrACTTTAGGAAACTAAGAA 

AAAGATaTTTCAACCraGTATCTAGCTXnuro . (SEQ ID NO: 107) 

49741 CTCQGOGCrCrCTAAAAGCCACaTCCrciXSACAAGCAT^^^ 

GCCAGTTCCrCCCaUrCCAGCCTCCCrCGAGrecrcAATTCCAG^ 
ATTQGATGATQGAAAACCCAI ICil 1 1 ICCCAGTXXSATTCTAAATTACTrCEQQGrAAATA 
QGaCTATATATTCrOWVTrrCCO^GAGTATCTAAC^^ 
GATTTTTjnXXTPGAATAQCrAGrACTrTAQGAAAC^^ 
[G,A] 

. TATCTAGCraCTOWVCACATCATO^GTAlXmrrAAACCT^^ 
OVTTACCATAGTAGreTCATTUrATCATTCACAGTCTA^^ 

- TCGrrrcAGcrecoocrerAGreAc i gl ii i ccactcga , (seq id no:108) 



4984Gi.. . ATCTTTT(:MCCTQGTATUrAGCrCn:n"CAAAC^CAT6\TCAGrAT^ . ? :/... 

. . . TTCrCTGreOGTTUrCATTACC^TAGrAGT^^ 

:. [A.G] 

TAGTUnSGGGTAGIbl ICI IGlGbl I IG^GOGCCACrCTUrACTlGALIGLI I IGCACTC 
. (^CAtCrrccrCTTTATOXIAAO^C^ 
. . CrCTGCTTQGkTGACCCAGGA(jrGCCTCC(:ACrCAATAT^ 

. TTOGcrACTccaxnCTCcixy^ccjcreCT 

TCrATATEnWATl3GnnQQQCy\ATGa:CTTr^ (SEQ ID NO: 109) 



50102 CATTACGATAOTAGTXSTCATrGrATOmiSACAGTXnWAG^ 
TQGTTTCAQCraZCACrCTXjrACTGACrGCTTrCCACrc 

AACACfXnAQGTCTACCrcrGTAC I C I b I (j 1 1 1 CAGCATCTC I (aC I I (aC^TGACCOVQGA 
GTCCCrcCCACTCAATATOQCCACCATC^^ 

GAcccrGcrccwK>vvcAoyGAOVGA(^ccrrc 

[G,A] . • 

ATKCCTTTAGrACrrACTCAGGAGrrAGrrCCrCroQGAAGCaTC^ 
1 1 1 ICI lAONQCACTTTCACATTTyVATTaXSAGGTTCTC^ I ICIGAG 

AOlireAGCTnCjCTTAQGC^GTAGCr^^ 

AACC<XrATrAAGrAAATGAAAAGAO\GAACTCACAGACTCGMTrAGAGCT 

CCrCMTCTCMGCCATTAAGATGAAQGGGAGCCGGGCGrraGrGGCrCACGC^ (SEQ ID NO: 110) 

-50109 . . . ATAGTAGTCTCATTCTATCATTGACAGTCT^^ ICI ICICCI I IC 

AQCT«:CACrCTOTACnGACrcClTTCCAC^^ 

TAQGTCTACCnjrcTAC I C I C I C II I CAGCATCTC I CC II CCATGACCCAGGAGTiGCCrC 
CCACrCAATATQGCCACCATQCATQGTCATCI I ICICCrACrCCCrctcrCCTCACCCTC ' 
a"CCAG(^CACAGACAGA(^GCrrcCTXTrr^^^ 
[C.G.T] 

TTAGTACma"CAGGAGTTAGTrCCraX»y\AGCClTCTGrraAGTTTCCI I I ICI I 
A(^GCACnTG^C^TTCAATTCTXyvaJTTCTCKnAaTATCTTGCI I ICIGAGACreTCA 
GCrrCCTTAQQCAGTAGCTACrrcTATTCTTAQC^CTTQCC<^ ' 
ATTAAGTAAATCMAAGACAGAACreACAGACTCt^^ 

CTDWXPVTTAAGATGAAGGGGAfiGGGGGQGrra^^ (SEQ ID NO: 111) 

50747 . CCAGCCnm:AACGTCGGW\ACCCO\TTTCTACAAAAAATATAAAMTTAGTT^ 
TCQQQOTinxnGCXnCTACrCAQGATlKnGAQG^ 
AGAGGrnQOVGTCAGCtXSGGATDVCAGCAT^^ 



FIGURE3PP 



Docket No.: CLOOIIpsqor^ 



0 



Serial No.: To Be Assigned 
Inventors: Gennady MERKULOV et al. 
Tide: ISOLATED HUMAN TRANSPORTER ... 



51272 



52842 



61837 



62018 



NO: 116) 



CriUrcrCA/WW\AAAATAAAT/WVTAAATAAAQQQGAA(^TAAQGA^^ 
[G.A] 

Cilbbl lUVaUTQCCtOVAGACIGGIGGAblGIU I ICGGGAAAGATAATCATGAAAGAG 

Cra5AO\GATAMCA(mK:CAMTGrMTAQGAGrCTTQ^ 

GQGGCTATTGTAGCAt^ 

TCAACXTCAAAAAAGAGTCGAGACATTGrraOGGGAGAGreAQ^ 
AGAATATTTAAATAATTXiiVQGTAAGAMTGATX^ 

TAGACrAGAQQCAQQGAGMTATTTAMTAATTTGAQGrAAGAMT^^ 

AAQGTCATOTCTrrAAQGAATQGAGAAQGGAATGAACTGAGAAATAT^^ > 

TCAACAGAACrCAaGACn^CTXK3ATATXK3AGGTXy\^^ 

ATraAATTTCrAACrTCAGTX3ACTXX>XTTCAAAGA^ 

GT^TGCreAGTTTCAGATCnxnX3QGACATCrAO\GQGAGCT^ 

[G.A] 

TATATOVQCTAGCCATTAAGAGAGAGATCmWAGAGAQljl IGI KaCrGAGTTCAGCC 

ATTCGAAlXm^GGAirACTDXAGAAGAGCrrATAM 

TC<:AAAGQGAGAAGrAAAAGAAGAAACTrXK:AAAQGACAaGAGAAGAAATAG^ 

GATQQGAGAAAATC<:AGAGAGAQQGATQGC^TAQGACTCACTX5G^ 

QQQQGrCAGrACrACnSQGTAGnnGMTATMTAAGAATATCr^ 

TCAQQGTCCTTTTCAQQGCTOVGrrAACn^^^ 
GGCAACnTACrrAAAGraOUT^CTATTACCrCATCrCTA^ 

GTx^CATAGrrrrAO\TAeG^GG<::AG'\GTX5CCTX5A(.i 1 1 1 iGGcrcrcrccreAACTCTT 

CCCI I IGlATATQGTAItii I ICQGOGMTAQGAGCCTCAAGCACmTCCrrTAAATATT 
TATCCTCCATGAGrC:ACrAAAC3GrrTACTTCrcrACI 1 1 IbATAGGrecrGreQQQCTOCA 

[G,A] ■ I 

GGTATAAAAGOTACCTTCAAAGmCrcTTAAAGT^QGAAQGI 1 1 1 lAAQCAAATTAT 
GTITA ATGATTnXSACAATCTGACATCCAGGAAAATTAATAGG^ 
GTITTATXn'AACACrCTUrAGrrCAQGAAACAGAGCCCrTG^ 
GGAGGAATCTCrOOTATrraQGAATCrCATGAAATWAAT^^ I I I lATCATG 

AGCACXIAAAAC^CAGATTTXKTAQGAGAAACTOVr^ 

GAQGAACCTCCATCTXATTlTCCATACrrAACr^ I I Ibl 1 1 1 1 lAACATTTCTAT 
CAATCfAtmAAGATTCCAATTTOTCC^TCrcGrc 

GGrCrACrACTATTGCTXjrCil IGLIbl I lATTCGTCCarCAGI ICICilAAblGI I IGCT 
TCATATATTTAGGAGCTrAATATrAQCTCCATATCAAGTTAT^ I ICI ICCTCCTAAAG 
TGACCCATrnTdXTTAlXnAATCTXrATOTPS^ 

[A.G] ^ 

KTATTTTXrrCTGATCnAAlTATCQC3CA^ 

AATATL I 1 1 I ICCATCCTTTCACnTCAGCrrATSTUPCTCaTAGATCTAAAG^ 
TCATAGATAAQGTATAGrnGATTCrcrA I G I G I I ATTCACTCAGCAATTTATATL I I I I A 
G1TACX»GATTTAATCCATTTA(:ATTTAAAGC^G^^A<J^^ I G 1 1 

GTCATTTQGCrAQCTACLI I I I lATLI I IGlCjCrGTCGCmTCTGI I I I ICCCTTCCTC 

CATATAWAQGAGCrrAATATrAGGTCGATATXSAAGTTATAAl I I LI ICCTGGTAAAGT 
GAC CCATT TATCATTATGTAATGTCQVr C I I I GlC rCI IGIGACA G I I I G I G I GI l A AAA 
TCTATTTTCTOXyaXTTAATTAlXmiACCCCrrrrG^^ 1 1 I lATQG 

AATATLI 1 1 1 ICOVTCXTrTOVCTTTCAQCTrATCTXnxn'CC™ 
TCATAGATAAQGrATAGTTXSATTGRCT^^ 
[A.G] 

Tr AGGGG ATTTAATCC ATTTACA T TTAAAGC ACnTA CIWAGQGAAGGAC nALIGI IG 
TCATTTGGCTAGCr y^Cl 1 1 1 lATGTTGrCCrGrGGLI 1 1 IL IGI 1 1 1 i CCCTTC CTCr 
CTTCCPGGLI ILI ILIGIGI I I IGI IGAI I I 1 1 ll I I 1 1 1 1 IGIAGTGATAIGI ILIGAT 
TCCaTCrCATTTCCLI I IGlGIGCATTCTATAGATGCrAI I I I IGIGGI lACCATTSCA 
ACrACATAAAQCATACTAAASFTATAGC^CTrATTTrAAC^^ 



(SEQ ID NO: 112). 



(SEQ ID N0:113>. 



(SEQ ID NO: 114) 



(SEQ ID NO: 115) 



(SEQ ID 
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65562 GACnyWVTTCAGACAOVTTXAGTCTSATTCTMCCCrCCI GTCreCCAGCTCTGATCCA 

GMCrnXKAreACTWACQGaCATAGATTUrCTATXX^^ 

ACCTAAAAGrCreATCATTTTACATCnm-G^GACAT 

TOOVAAbl IGI IAGTray\ATTTCAAAGCXnTrMTMTCrAQ0CCOW.I I Ibl ICACTC 

TCIG IG IMTAACO^ATACAACMTTQ^^ 

[A,G] ^ - "~ - 

I IGICI IGGI IGlGCCAGCAACALIGbl I I ICGCTTTCTCTTCCIGLI IGI ICAGGTCAT 

TTCCAAQGCCOVQGTCI I IGIGLI 1 1 1 ICCCAAGCTTCCCAGAQLI ICI ICCATACTCCC 

CTTACTTCCreAGATTTAACIGI ICICICI 1 0\GC!QCnxncrAGTAAGAAQGA(QGCAGC 

AGCAGCACTCTGGGGTCGTCGAAAGTTnACCAGC^^ 

CCCrACCATTTTCTACTTAGAI I 1 1 I I lAQGACAAATTTCrCOXTCTTTCTAAGCCTCCA 

65780 TCTAGCCCCAOTTGTTXIACrCTCTCnnGTAATAACC^ 

. CATAGCACATGGrrACTCCrCCOil IGICI IGbl IGIGCCAGCAAC ACIGGI I I ICXSCnT 

CR.I ICLIGLI IGI IGAGGTOVrTTCOVAGQCCCAGGTCI I IGIGLI I I I ICCCAAGGT 

CCCAGAGLI IGI ICCATACTCCCCrrACrTCCreAGAmAACTCTTCrcrcrrOVGOGC 

TTXnmGrAAGAAQGAQGO«jOVGO\GOVCrcra^^ 

[G.A] 

GAGrCAGACCATTOGATCTCAGCCCrACCATTfTCTACrrAGAI 1 1 1 II lAQG^^CAAATT 

, . TGrCCATCrnXTAAGCCTCCAATTGCrCACrfACAAAATTCAW 

, AAGATTOGTATCGAAOGTAATTAACCBCAGrATTTAGAAO^^^ 

mmOAro^TTACrATAGrrAQGACACTCACT^ 

. . . ; TAAAAGGG MG I IG ILII G GGCI ICI I G ^TAA A I G I r GiC CrnTACn^^^ 

66092 . . TTCGArCTGAGCCCTACCATTTTCTAGTTAGAi I I I II lAGGAGAAATTTCTCCATCTTT 
CtAAQCOCdVVTTQCrOtfTrACAAAATTWATAAC^^ 
GGAAQGTAATTAACOZAGrATTTAGAACATAGrAATTAATAAATAACT^^ 
ATTACTATACrrTAQGACACrCACTXjrTAGGnCTATACAAAGA^^ 
UGTCrrGGGLI ICI IGGAATAAAIGI IGlCCTTmaXjrATTTTAGAATATCATTGre 
. [G.A] __ 
GTCATAATIIbl I IGI IGlCATAATAATXW^AOkTACrreAATATTAAATTACCCTCT^ 
TTTATTTTTTAGCCAlTnTAGAAQGrrCCCCACAGCTGMTATQGTTtj^ 
. GMTTATTTCCAAAGMGGMTACCAQGACrrTACAGAQGCATCACCCCAAACrrc^^ 
AQGTGCrCCCTlKnX5TAQQCATOVCjrrATGl^^ 

GAGrAA<XCAGMATCATCrrQCAI I I I I IGCI I lAGCXHWAATTGAAACTfTCAACA 

66617 . ATCAAGOWXCrmGGAGrAACCCAGAAATCATGTTGOkl I I I I IGCI I lAGCCTGATA 
ATTXyW^CTTTCAACAATGrCTlQGAGTGACI I I 1 1 CTCCTCGAATTCAAACAAGTCrATG 
GCAAAAGAAGCTCG^I 1 1 1 1 I ICACAAAAGGGAAGATGGTAACAATXK^CACrTCAAAa: 
ttnOQGCTAAATTATATCrACAOVGAAAIGI ICAAAATCATAGTTTTAAIGIGI 1 1 IGAA 
AAQGCO^OVGAATTATACTTTATCI 11 ICI lAATAATXOGCAMTCTCTQCCXTCAATC 

. [C,T] 

GAMTaTWWVTCTACrQGGTGAACAAAAl IIGII I IGIGIGI lAGAOTTATAAATCA 
. TTAATCTTTATTTCQOGIGGI I lAGGnTATXKlOVGrra I ICI IGT 

mAtATATTTTCAAIGICI I lATAGAI I ICI I lAAATTTtrmTAGAAOCATTAATAG 
. AAMTO\TTAC:ATTTAAMTATACCTTACAGC^V\AGCATCCAMTAACnAT^^ 
TOTCCTTAI I 1 1 ICI I ICAGCPGAATACIGAATCAGCACAGTCCTQGAAI I ICIGAAGGGA 

66892 . ATCCTGCAAATCrCT«:CCTX3MTCGGAMTaGAAMTXnACT^^ 
GrmrnTJnGTTAGAGrrATAAATCA^ 

AGrrCCTTTATATTTAAAl I ICI IGI I I lATATAI I I IGAAIGICI I lATAGAI I ICI I I 
AAATTTCCTTATAGMCCATTAATAGAAAATCATTAOmTAAMTATACC^^ 
AAGCATCCAAATAACTATAGGGTITAIGICCI lAI 1 1 1 ICI I lONGCTCAATAGGAATCA 
. [G,A] 

CACftGnnQGTGGAATTTCTGAAGGGAAGTCATTWV\TTATATTT^ 
TCCATTTTACO\CTCrACCATTAI I IGGI I CCTGGAGTTATACACTAATmrOkGTATAT 
TAClXnTAMTTACCMCAOVAQQOVATmTTTCAAAGATTCa^ 



(SEQ ID NO: 117) 



(SEQ ID .N0:11S) 



(SEQ ID N0:1I9) 



(SEQ ID NO: 120) 
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ICil 1 1 ICCI I IGICCI I IGI I ICCTACCI 1 1 IGAATCAGATTCC3CmTTACTO«3GAAGA (SEQ ID NO: 121) 

67263 . CAOOTACCATTATTTQGrrCCreGAGnTATACACTMT^ ' 
TTACX>\ACAO\AQGCAATTTATTTCAAAGATrcGG^ 
. CAGCAQGAMCXSAAATCCrrreACTTOTATCAGCI ICIGCAGAGCAICI I ICil I I ICCTT 
. TXjrcdTTClTTCCTACat^^ — - - - 
CCATTGTAGTAAiXrGAAAl I ICI 1 1 i I IAATTIXArcAAGl^Tny\lD\TGAGCAA 
[G.A] 

TG AIbKaL I l A TTTCTCCCTCA L I G I I C AATATGTTGAA LI IGLIb l 1 1 I C AATATGGG 

... CAQCACAAAQGnyNGAGATACATATTAATAGTAGrATXnA™ 

CCTATATrrAAATTy^AAQGCCO^ATTTCTAAACATATAC^^ 

. AACTTTTAQGAACATXirrAGGATATAQCyVGACrrAATTTATAATAATX^^ 1 1 1 1 I 
TATTTTACTAAAGCXAI 1 1 1 lATAGTXZAACTATCI 1 1 ICI lAI I IGlGllaATTAQVAOT (SEQ ID NO: 122) 

67651 ATAGTAtnATXHATTACTCTrATACATTAGATACCrATATTT^ 

G^AAAC^TATA(:ATTCATATTCTa^CTTXK:CCCAAGTTT^^^ 

GAGACrtAATTTATAATAATGAGAGC Al I I H I l A TTTTACTAAAGCDynnnTATAGTC 

AACTATL! 1 1 ILIIAI I IGlGIGATTACWVCTTAGAAAAATATrrACTAGTTGAAGTTAT 

TATCAbI I 1 1 lAATTTAbl ICI lAAACTCAnTGXCTTCTAATAATrrCTXnTATAAATT 

. . [G,T] 

CCAGCATTrrAATXyVAAATCTAATCATffTAATAQGO^I III CI! lATrTCAACXTACXnx: 

TTTTAI 1 1 ICICiAACCAAAGAGAAAGATGGALIUilGI I IIjIGAAAOM I II lAAAAATC 
. TAGrTTCATTTATATTAGrr A I Cil I Kj ATAAATgrCTCAGr AI I I I I A TAATATCATAAG 

CCrGQGATTCTACrrTTAGQGTTATTTXnACI 1 1 ItaAGrrAATATATAAAGTCACAATATT 

AAQGTAC^TCATCAQCTOTTtrATTmACra;^^ (SEQ ID NO: 123) 

67935 A 1 1 lU GI lATAMTTGCCAQCATTTTAATCAAMTCTAA^ 

TTATTTGAACCTACCrCmTAI II ICItjAACCAAAGAGA(\AGATQGALIWjlGI I IGIG 

AAACAI I I I lAAAAATUTAGnTCATTTATATTAGTTAIbl I I (aATAAATCTCTCACjrAT 

TTTtATAATATWAAQCCTXiQGATTXTACrrmGQCJmTT^ 

. TATAAAGTX;A(^TATrAAQGtA(^TWCAQCra^ 

[C.T] • . . . . 

. . GGAAATGAATAATTTTXKTAACAACTrrGAAATTTOW\CTTCR5GAAAAT^^ 

TTCATTGTTCATrATXyVATTTAAAtTUrAAGGrATGAATGTTGA 1 1 I Ci I CTGTAGATCmG 

TATCrrrTCCAAAAAATGATrCrcrATLI I I ICiGAAAAAAGCGGAGAGfTGAAGATAGTA 

TATTTO«jrACnACnWVrATTTA(^ 

AAAATTAU IGI I I ICCACal I I 1 1 Al 1 1 1 1 1 1 lAGAGAAAATTCTTMGTCTCAlGrnTCC 

69000 TTCAGAAATAACmTOVCnTATTTCrcrAAQL 1 1 U I GCTTACCraGATACGreAO^QG 

TGAGATOQCTCTAQOVGACACTTSGOVGrrc 

TCt^ONAQGCAGCTCTXnXJTCCAATTtSZCAGCArC^ 

TOTTAGAAAAATQCreCCATAI I IGI 1 1 CrCACCTATTAGTCTTCTCrGGCAGrCAAGAG 

: AATAAATrnTCCAAGCAGAGATTOTACriTACAOT^^ 

[T.G] 

GTPGC^TlTTnAAAAATCTOKTVTOiCrrc 

I I llbl ICrCATCGAACtTCCI I I I 1 1 GAAAAGAGCACCAAAQGACTAAAAATACPGTGG 

AQQGAGCAACCCrCCrrTGCOVTATXCTCrCATTQGGAGACATlCnX]^^ 

CATTTAGGCCACrcrcraSGAGAGCACATCCTATGATC^^ 

ACPGlXKTCAASrCCAAGCTGACCAQCrTTCTl^ 

69134 cnnurccMTreccAGCATcnKTCcrcTWCT 

TCCCATAI I Itil I ICTO^CCTATTAG^CTTGTC^CCCACTCAAGAGAATAAATT^ATCCA 

AGCAGAGATTCrAanTACAGTAI I 11 GIL 1 1 ICaAGCrrOGCATTAGGrTCCATTPCTAA 

AAATCraOCATTSQCrrGCTXZATGCCaAATAGGAAC^^ I I IGI ICTCATG 

GAACrrCCrrTTTTGAAAAGAQCACXAAAQGAGTAAAAATACTCrrXK^^ 
[C.T] 

. CTrroCCATATXXnXTCATTTiQGAGACATXjre^^ 

TCTOGGAGftGOWIATOn'ATGAIGI ICICX:CAG(XTAGCCCCrR:CACnnGCrCAAGn: 



(SEQ ID NO: 124) 



(SEQ ip NO: 125) 
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CAAQCTX^CCAQCTTTCreACCACAGrcrAAACAAAG^ 
TCCTATACCCAGA (SEQ ID NO: 126) 
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